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GENES AND POLYMORPHISMS ASSOCIATED 
WITH CARDIOVASCULAR DISEASE AND THEIR 
USE 

RELATED APPLICATIONS 

[0001] This application is a divisional application of 
copending U.S. patent application Ser. No. 09/802,640, filed 
Mar. 9, 2001, to Andreas Braun, Aruna Bansal and Patrick 
Kleyn, entitled "GENES AND POLYMORPHISMS ASSO- 
CIATED WITH CARDIOVASCULAR DISEASE AND 
THEIR USE." The benefit of priority to this application is 
claimed and the subject matter of the application is incor- 
porated herein in its entirety. 

FIELD OF THE INVENTION 

[0002] The field of the invention involves genes and 
polymorphisms of these genes that are associated with 
development of cardiovascular disease. Methods that use 
polymorphic markers for prognosticating, profiling drug 
response and drug discovery are provided. 

BACKGROUND OF THE INVENTION 

[0003] Diseases in all organisms have a genetic compo- 
nent, whether inherited or resulting from the body's 
response to environmental stresses, such as viruses and 
toxins. The ultimate goal of ongoing genomic research is to 
use this information to develop new ways to identify, treat 
and potentially cure these diseases. The first step has been to 
screen disease tissue and identify genomic changes at the 
level of individual samples. The identification of these 
"disease" markers has then fueled the development and 
commercialization of diagnostic tests that detect these errant 
genes or polymorphisms. With the increasing numbers of 
genetic markers, including single nucleotide polymorphisms 
(SNPs), microsatellites, tandem repeats, newly mapped 
introns and exons, the challenge to the medical and phar- 
maceutical communities is to identify genotypes which not 
only identify the disease but also follow the progression of 
the disease and are predictive of an organism's response to 
treatment. 

[0004] Polymorphisms 

[0005] Polymorphisms have been known since 1901 with 
the identification of blood types. In the 1950's they were 
identified on the level of proteins using large population 
genetic studies. In the 1980's and 1 990' s many of the known 
protein polymorphisms were correlated with genetic loci on 
genomic DNA. For example, the gene dose of the apolipo- 
protein E type 4 allele was correlated with the risk of 
Alzheimer's disease in late onset families (see, e.g., Corder 
et al. (1993) Science 261: 921-923; mutation in blood 
coagulation factor V was associated with resistance to 
activated protein C (see, e.g., Bertina et al. (1994) Nature 
369:64-67); resistance to HIV-1 infection has been shown in 
Caucasian individuals bearing mutant alleles of the CCR-5 
chemokine receptor gene (see, e.g., Samson et al. (1996) 
Nature 382:722-725); and a hypermutable tract in antigen 
presenting cells (APC, such as macrophages), has been 
identified in familial colorectal cancer in individuals of 
Ashkenzi jewish background (see, e.g., Laken et al. (1997) 
Nature Genet. 17:79-83). There may be more than three 
million polymorphic sites in the human genome. Many have 
been identified, but not yet characterized or mapped or 


associated with a disease. Polymorphisms of the genome can 
lead to altered gene function, protein function or mRNA 
instability. To identify hose polymorphisms that have clini- 
cal relevance is the goal of a world-wide scientific effort. 
Discovery of such polymorphisms will have a fundamental 
impact on the identification and development of diagnostics 
and drug discovery. 

[0006] Single Nucleotide Polymorphisms (SNPs) 

[0007] Much of the focus of genomics has been in the 
identification of SNPs, which are important for a variety of 
reasons. They allow indirect testing (association of haplo- 
types) and direct testing (functional variants). They are the 
most abundant and stable genetic markers. Common dis- 
eases are best explained by common genetic alterations, and 
the natural variation in the human population aids in under- 
standing disease, therapy and environmental interactions. 

[0008] The organization of SNPs in the primary sequence 
of a gene into one of the limited number of combinations 
that exist as units of inheritance is termed a haplotype. Each 
haplotype therefore contains significantly more information 
than individual unorganized polymorphisms and provides an 
accurate measurement of the genomic variation in the two 
chromosomes of an individual. While it is well-established 
that many diseases are associated with specific variation in 
gene sequences and there are examples in which individual 
polymorphisms act as genetic markers for a particular phe- 
notype, in other cases an individual polymorphism may be 
found in a variety of genomic backgrounds and therefore 
shows no definitive coupling between the polymorphism 
and the phenotype. In these instances, the observed haplo- 
type and its frequency of occurrence in various genotypes 
will provide a better genetic marker for the phenotype. 

[0009] Although risk factors for the development of car- 
diovascular disease are known, such as high serum choles- 
terol levels and low serum high density lipoprotein (HDL) 
levels, the genetic basis for the manifestation of these 
phenotypes remains unknown. An understanding of the 
genes that are responsible for controlling cholesterol and 
HDL levels, along with useful genetic markers and muta- 
tions in these genes that affect these phenotypes, will allow 
for detection of a predisposition for these risk factors and/or 
cardiovascular disease and the development of therapeutics 
to modulate such alterations. Therefore, it is an object herein 
to provide methods for using polymorphic markers to detect 
a predisposition to the manifestation of high serum choles- 
terol, low serum HDL and cardiovascular disease. The 
ultimate goals are the elucidation of pathological pathways, 
developing new diagnostic assays, determining genetic pro- 
files for positive responses to therapeutic drugs, identifying 
new potential drug targets and identifying new drug candi- 
dates. 

SUMMARY OF THE INVENTION 

[0010] A database of twins was screened for individuals 
which exhibit high or low levels of serum cholesterol or 
HDL. Using a full genome scanning approach, SNPs present 
in DNA samples from these individuals were examined for 
alleles that associate with either high levels of cholesterol or 
low levels of HDL. This lead to the discovery of the 
association of the cytochrome C oxidase subunit VIb 
(COX6B) gene and the N-acetylglucosaminyl transferase 
component GPI-1 (GPI-1) gene with these risks factors for 
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developing cardiovascular disease. Specifically, a previously 
undetermined association of an allelic variant at nucleotide 
86 of the COX6B gene and high serum cholesterol levels has 
been discovered. In addition, it has been discovered that an 
allelic variant at nucleotide 2577 of the GPI-1 gene is 
associated with low serum HDL levels. There was no 
previously known association between these two genes and 
risk factors related to cardiovascular disease. 

[0011] Methods are provided for detecting the presence or 
absence of at least one allelic variant associated with high 
cholesterol, low HDL and/or cardiovascular disease by 
detecting the presence or absence of at least one allelic 
variant of the COX6B gene or the GPI-1 gene, individually 
or in combination with one or more allelic variants of other 
genes associated with cardiovascular disease. 

[0012] Also provided are methods for indicating a predis- 
position to manifesting high serum cholesterol, low serum 
HDL and/or cardiovascular disease based on detecting the 
presence or absence of at least one allelic variant of the 
COX6B or GPI-1 genes, alone or in combination with one 
or more allelic variants of other genes associated with 
cardiovascular disease. These methods, referred to as hap- 
lotyping, are based on assaying more than one polymor- 
phism of the COX6B and/or GPI-1 genes. One or more 
polymorphisms of other genes associated with cardiovascu- 
lar disease may also be assayed at the same time. A collec- 
tion of allelic variants of one or more genes may be more 
informative than a single allelic variant of any one gene. A 
single polymorphism of a collection of polymorphisms 
present in the COX6B and/or GPI-1 genes and in other genes 
associated with cardiovascular disease may be assayed indi- 
vidually or the collection may be assayed simultaneously 
using a multiplex assay method. 

[0013] Also provided are microarrays comprising a probe 
selected from among an oligonucleotide complementary to 
a polymorphic region surrounding position 86 of the sense 
strand of the COX6B gene coding sequence; an oligonucle- 
otide complementary to a polymorphic region surrounding 
the position of the antisense strand of COX6B correspond- 
ing to position 86 of the sense strand of the COX6B gene 
coding sequence; an oligonucleotide complementary to a 
polymorphic region surrounding position 2577 of the sense 
strand of the GPI-1 gene; and an oligonucleotide comple- 
mentary to a polymorphic region surrounding the position of 
the antisense strand of GPI-1 corresponding to position 2577 
of the sense strand of the GPI-1 gene. Microarrays are well 
known and can be made, for example, using methods set 
forth in U.S. Pat. Nos. 5,837,832; 5,858,659; 6,043,136; 
6,043,031 and 6,156,501. 

[0014] Further provided are methods of utilizing allelic 
variants of the COX6B or GPI-1 gene individually or 
together with one or more allelic variants of other genes 
associated with cardiovascular disease to predict a subject's 
response to a biologically active agent that modulates serum 
cholesterol, serum HDL, or a cardiovascular drug. 

[0015] Also provided are methods to screen candidate 
biologically active agents for modulation of cholesterol, 
HDL or other factors associated with cardiovascular disease. 
These methods utilize cells or transgenic animals containing 
one or more allelic variants of the COX6B gene and/or the 
GPI-1 gene alone or in combination with allelic variants of 
one or more other genes associated with cardiovascular 


disease. Such animals should exhibit high cholesterol, low 
HDL or other known phenotypes associated with cardiovas- 
cular disease. Also, provided are methods to construct 
transgenic animals that are useful as models for cardiovas- 
cular disease by using one or more allelic variants of the 
COX6B gene and/or the GPI-1 gene alone or in combination 
with allelic variants of one or more other genes associated 
with cardiovascular disease. 

[0016] Further provided are combinations of probes and 
primers and kits for predicting a predisposition to high 
serum cholesterol, low HDL levels and/or cardiovascular 
disease. In particular, combinations and kits comprise 
probes or primers which are capable of hybridizing adjacent 
to or at polymorphic regions of the COX6B and/or GPI-1 
gene. The combinations and kits can also contain probes or 
primers which are capable of hybridizing adjacent to or at 
polymorphic regions of other genes associated with cardio- 
vascular disease. The kits also optionally contain instruc- 
tions for carrying out assays, interpreting results and for 
aiding in diagnosing a subject as having a predisposition 
towards developing high serum cholesterol, low HDL levels 
and/or cardiovascular disease. Combinations and kits are 
also provided for predicting a subject's response to a thera- 
peutic agent directed toward modulating cholesterol, HDL, 
or another phenotype associated with cardiovascular dis- 
ease. Such combinations and kits comprise probes or prim- 
ers as described above. 

[0017] In particular for the methods, combinations, kits 
and arrays described above, the polymorphisms are SNPs. 
The detection or identification is of a T nucleotide at position 
86 of the sense strand of the COX6B gene coding sequence 
or the detection or identification of an A nucleotide at the 
corresponding position in the antisense strand of the COX6B 
gene coding sequence. Also embodied is the detection or 
identification of an A nucleotide at position 2577 of the sense 
strand of the GPI-1 gene or the detection or identification of 
a T nucleotide at the corresponding position in the antisense 
strand of the GPI-1 gene. In addition to the SNPs discussed 
above, other polymorphisms of the COX6B and GPI-1 
genes can be assayed for association with high cholesterol or 
low HDL, respectively, and utilized as disclosed above. 

[0018] Other genes containing allelic variants associated 
with high serum cholesterol, low HDL and/or cardiovascular 
disease, include, but are not limited to: cholesterol ester 
transfer protein, plasma (CETP); apolipoprotein A-IV (APO 
A4); apolipoprotein A-I (APO Al); apolipoprotein E (APO 
E); apolipoprotein B (APO B); apolipoprotein C-III (APO 
C3); a gene encoding lipoprotein lipase (LPL); ATP-binding 
cassette transporter (ABC 1); paraoxonase 1 (PON 1); 
paraoxonase 2 (PON 2); 5, 10-methylenetetrahydro folate r 
reductase (MTHFR); a gene encoding hepatic lipase, E-se- 
lectin, G protein beta 3 subunit, and angiotensin II type 1 
receptor gene. 

[0019] The detection of the presence or absence of an 
allelic variant can utilize, but are not limited to, methods 
such as allele specific hybridization, primer specific exten- 
sion, oligonucleotide ligation assay, restriction enzyme site 
analysis and single-stranded conformation polymorphism 
analysis. 

[0020] In particular, primers utilized in primer specific 
extension hybridize adjacent to nucleotide 86 of the COX6B 
gene or nucleotide 2577 of the GPI-1 gene or the corre- 
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sponding positions on the antisense strand (numbers refer to 
GenBank sequences, see pages 15-17). A primer can be 
extended in the presence of at least one dideoxy nucleotide, 
particularly ddG, or two dideoxynucleotides, particularly 
ddG and ddC. Preferably, detection of extension products is 
by mass spectrometry. Detection of allelic variants can also 
involve signal moieties such as radioisotopes, enzymes, 
antigens, antibodies, spectrophotometric reagents, chemilu- 
minescent reagents, fluorescent reagents and other light 
producing reagents. 

[0021] Other probes and primers useful for the detection 
of allelic variants include those which hybridize at or 
adjacent to the SNPs described in Tables 1-3 and specifically 
those that comprise SEQ ID NOs.: 5, 10, 43, 48, 53, 58, 63, 
68, 73, 78, S3, 88, 93, 98, 103, 108, 113, and 118. 

DESCRIPTION OF THE DRAWINGS 
[0022] FIG. 1 depicts the allelic frequency and genotype 
for pools and individually determined samples of blood from 
individuals having low cholesterol levels and those with 
high cholesterol levels. 

[0023] FIG. 2 depicts the allelic frequency and genotype 
for pools and individually determined samples of blood from 
individuals having high HDL levels and those with low HDL 
levels. 

DETAILED DESCRIPTION OF PREFERRED 
EMBODIMENTS 

[0024] A. Definitions 

[0025] Unless defined otherwise, all technical and scien- 
tific terms used herein have the same meaning as is com- 
monly understood by one of skill in the art to which this 
invention belongs. All patents, patent applications and pub- 
lications referred to throughout the disclosure herein are, 
unless noted otherwise, incorporated by reference in their 
entirety. In the event that there are a plurality of definitions 
for terms herein, those in this section prevail. 

[0026] As used herein, sequencing refers to the process of 
determining a nucleotide sequence and can be performed 
using any method known to those of skill in the art. For 
example, if a polymorphism is identified or known, and it is 
desired to assess its frequency or presence in nucleic acid 
samples taken from the subjects that comprise the database, 
the region of interest from the samples can be isolated, such 
as by PCR or restriction fragments, hybridization or other 
suitable method known to those of skill in the art, and 
sequenced. For purposes herein, sequencing analysis is 
preferably effected using mass spectrometry (see, e.g., U.S. 
Pat. Nos. 5,547,835, 5,622,824, 5,851,765, and 5,928,906). 
Nucleic acids can also be sequenced by hybridization (see, 
e.g., U.S. Pat. Nos. 5,503,980, 5,631,134, 5,795,714) and 
including analysis by mass spectrometry (see, U.S. applica- 
tion Ser. Nos. 08/419,994 and 09/395,409). Alternatively, 
sequencing may be performed using other known methods, 
such as set forth in U.S. Pat. Nos. 5,525,464; 5,695,940; 
5,834,189; 5,869,242; 5,876,934; 5,908,755; 5,912,118; 
5,952,174; 5,976,802; 5,981,186; 5,998,143; 6,004,744; 
6,017,702; 6,018,041; 6,025,136; 6,046,005; 6,087,095; 
6,117,634, 6,013,431, WO 98/30883; WO 98/56954; WO 
99/09218; WO/00/58519, and the others. 

[0027] As used herein, "polymorphism" refers to the coex- 
istence of more than one form of a gene or portion thereof. 


A portion of a gene of which there are at least two different 
forms, i.e., two different nucleotide sequences, is referred to 
as a "polymorphic region of a gene". A polymorphic region 
can be a single nucleotide, the identity of which differs in 
different alleles. A polymorphic region can also be several 
nucleotides in length. 

[0028] As used herein, "polymorphic gene" refers to a 
gene having at least one polymorphic region. 

[0029] As used herein, "allele", which is used interchange- 
ably herein with "allelic variant" refers to alternative forms 
of a gene or portions thereof. Alleles occupy the same locus 
or position on homologous chromosomes. When a subject 
has two identical alleles of a gene, the subject is said to be 
homozygous for the gene or allele. When a subject has two 
different alleles of a gene, the subject is said to be heterozy- 
gous for the gene. Alleles of a specific gene can differ from 
each other in a single nucleotide, or several nucleotides, and 
can include substitutions, deletions, and insertions of nucle- 
otides. An allele of a gene can also be a form of a gene 
containing a mutation. 

[0030] As used herein, the term "subject" refers to mam- 
mals and in particular human beings. 

[0031] As used herein, the term "gene" or "recombinant 
gene" refers to a nucleic acid molecule comprising an open 
reading frame and including at least one exon and (option- 
ally) at least one intron sequence. A gene can be either RNA 
or DNA. Genes may include regions preceding and follow- 
ing the coding region (leader and trailer). 

[0032] As used herein, "intron" refers to a DNA sequence 
present in a given gene which is spliced out during mRNA 
maturation. 

[0033] As used herein, the term "coding sequence" refers 
to that portion of a gene that encodes an amino acid 
sequence of a protein. 

[0034] As used herein, the term "sense strand" refers to 
that strand of a double-stranded nucleic acid molecule that 
encodes the sequence of the mRNA that encodes the amino 
acid sequence encoded by the double -stranded nucleic acid 
molecule. 

[0035] As used herein, the term "antisense strand" refers 
to that strand of a double -stranded nucleic acid molecule that 
is the complement of the sequence of the mRNA that 
encodes the amino acid sequence encoded by the double- 
stranded nucleic acid molecule. 

[0036] As used herein, a DNA or nucleic acid homolog 
refers to a nucleic acid that includes a preselected conserved 
nucleotide sequence. By the term "substantially homolo- 
gous" is meant having at least 80%, preferably at least 90%, 
most preferably at least 95% homology therewith or a less 
percentage of homology or identity and conserved biological 
activity or function. 

[0037] Regarding hybridization, as used herein, stringency 
conditions to achieve specific hybridization refer to the 
washing conditions for removing the non-specific probes or 
primers and conditions that are equivalent to either high, 
medium, or low stringency as described below: 
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1) high stringency: 0.1 x SSPE, 0.1% SDS, 65° C. 

2) medium stringency: 0.2 x SSPE, 0.1% SDS, 50° C. 

3) low stringency: 1.0 x SSPE, 0.1% SDS, 50° C. 


[0038] It is understood that equivalent stringencies may be 
achieved using alternative buffers, salts and temperatures. 

[0039] As used herein, "heterologous DNA" is DNA that 
encodes RNAand proteins that are not normally produced in 
vivo by the cell in which it is expressed or that mediates or 
encodes mediators that alter expression of endogenous DNA 
by affecting transcription, translation, or other regulatable 
biochemical processes or is not present in the exact orien- 
tation or position as the counterpart DNA in a wildtype cell. 
Heterologous DNA may also be referred to as foreign DNA. 
Any DNA that one of skill in the art would recognize or 
consider as heterologous or foreign to the cell in which is 
expressed is herein encompassed by heterologous DNA. 
Examples of heterologous DNA include, but are not limited 
to, DNA that encodes traceable marker proteins, such as a 
protein that confers drug resistance, DNA that encodes 
therapeutically effective substances, such as anti-cancer 
agents, enzymes and hormones, and DNA that encodes other 
types of proteins, such as antibodies. Antibodies that are 
encoded by heterologous DNA may be secreted or expressed 
on the surface of the cell in which the heterologous DNAhas 
been introduced. 

[0040] As used herein, a "promoter region" refers to the 
portion of DNA of a gene that controls transcription of the 
DNA to which it is operatively linked. The promoter region 
includes specific sequences of DNA that are sufficient for 
RNA polymerase recognition, binding and transcription 
initiation. This portion of the promoter region is referred to 
as the promoter. In addition, the promoter region includes 
sequences that modulate this recognition, binding and tran- 
scription initiation activity of the RNA polymerase. These 
sequences may be cis acting or may be responsive to trans 
acting factors. Promoters, depending upon the nature of the 
regulation, may be constitutive or regulated. 

[0041] As used herein, the phrase "operatively linked" 
generally means the sequences or segments have been 
covalently joined into one piece of DNA, whether in single 
or double stranded form, whereby control or regulatory 
sequences on one segment control or permit expression or 
replication or other such control of other segments. The two 
segments are not necessarily contiguous. For gene expres- 
sion a DNA sequence and a regulatory sequence(s) are 
connected in such a way to control or permit gene expression 
when the appropriate molecular, e.g., transcriptional activa- 
tor proteins, are bound to the regulatory sequencefs). 

[0042] As used herein, the term "vector"' refers to a nucleic 
acid molecule capable of transporting another nucleic acid to 
which it has been linked. One type of preferred vector is an 
episome, i.e., a nucleic acid capable of extra-chromosomal 
replication. Preferred vectors are those capable of autono- 
mous replication and/or expression of nucleic acids to which 
they are linked. Vectors capable of directing the expression 
of genes to which they are operatively linked are referred to 
herein as "expression vectors". In general, expression vec- 
tors of utility in recombinant DNA techniques are often in 


the form of "plasmids" which refer generally to circular 
double stranded DNA loops which, in their vector form are 
not bound to the chromosome. "Plasmid" and "vector" are 
used interchangeably as the plasmid is the most commonly 
used form of vector. Also included are other forms of 
expression vectors that serve equivalent functions and that 
become known in the art subsequently hereto. 

[0043] As used herein, "indicating" or "determining" 
means that the presence or absence of an allelic variant may 
be one of many factors that are considered when a subject's 
predisposition to a disease or disorder is evaluated. Thus a 
predisposition to a disease or disorder is not necessarily 
conclusively determined by only ascertaining the presence 
or absence of one or more allelic variants, but the presence 
of one of more of such variants is among an number of 
factors considered. 

[0044] As used herein, "predisposition to develop a dis- 
ease or disorder" means that a subject having a particular 
genotype and/or haplotype has a higher likelihood than one 
not having such a genotype and/or haplotype for developing 
a particular disease or disorder. 

[0045] As used herein, "transgenic animal" refers to any 
animal, preferably a non-human animal, e.g. a mammal, bird 
or an amphibian, in which one or more of the cells of the 
animal contain heterologous nucleic acid introduced by way 
of human intervention, such as by transgenic techniques 
well known in the art. The nucleic acid is introduced into the 
cell, directly or indirectly by introduction into a precursor of 
the cell, by way of deliberate genetic manipulation, such as 
by microinjection or by infection with a recombinant virus. 
The term genetic manipulation does not include classical 
cross-breeding, or in vitro fertilization, but rather is directed 
to the introduction of a recombinant DNA molecule. This 
molecule may be integrated within a chromosome, or it may 
be extrachromosomally replicating DNA. In the typical 
transgenic animals described herein, the transgene causes 
cells to express a recombinant form of a protein. However, 
transgenic animals in which the recombinant gene is silent 
are also contemplated, as for example, using the FLP or CRE 
recombinase dependent constructs. Moreover, "transgenic 
animal" also includes those recombinant animals in which 
gene disruption of one or more genes is caused by human 
intervention, including both recombination and antisense 
techniques. 

[0046] As used herein, "associated" refers to coincidence 
with the development or manifestation of a disease, condi- 
tion or phenotype. Association may be due to, but is not 
limited to, genes responsible for housekeeping functions, 
those that are part of a pathway that is involved in a specific 
disease, condition or phenotype and those that indirectly 
contribute to the manifestation of a disease, condition or 
phenotype. 

[0047] As used herein, "high serum cholesterol" refers to 
a level of serum cholesterol that is greater than that consid- 
ered to be in the normal range for a given age in a 
population, e.g., about 5.25 mmoles/L or greater, i.e., 
approximately one standard deviation or more away from 
the age-adjusted mean. 

[0048] As used herein, "low serum HDL" refers to a level 
of serum HDL that is less than that considered to be in the 
normal range for a given age in a population, e.g. about 1.11 
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mmoles/Lor less, i.e., approximately one standard deviation 
or more away from the age-adjusted mean. 

[0049] As used herein, "cardiovascular disease" refers to 
any manifestation of or predisposition to cardiovascular 
disease including, but not limited to, coronary artery disease 
and myocardial infarction. Included in predisposition is the 
manifestation of risks factors such as high serum cholesterol 
levels and low serum HDL levels. 

[0050] As used herein, "target nucleic acid" refers to a 
nucleic acid molecule which contains all or a portion of a 
polymorphic region of a gene of interest. 

[0051] As used herein, "signal moiety" refers to any 
moiety that allows for the detection of a nucleic acid 
molecule. Included are moieties covalently attached to 
nucleic acids and those that are not. 

[0052] As used herein, "biologically active agent that 
modulates serum cholesterol" refers to any drug, small 
molecule, nucleic acid (sense and antisense), protein, pep- 
tide, lipid, carbohydrate etc. or combination thereof, that 
exhibits some effect directly or indirectly on the cholesterol 
measured in a subject's serum. 

[0053] As used herein, "biologically active agent that 
modulates serum HDL" refers to any drug, small molecule, 
nucleic acid (sense and antisense), protein, peptide, lipid, 
carbohydrate etc. or combination thereof that exhibits some 
effect directly or indirectly on the HDL measured in a 
subject's serum. 

[0054] As used herein, "expression and/or activity" refers 
to the level of transcription or translation of the COX6B or 
GPI-1 gene, mRNA stability, protein stability or biological 
activity. 

[0055] As used herein, "cardiovascular drug" refers to a 
drug used to treat cardiovascular disease or a risk factor for 
the disease, either prophylactically or after a risk factor or 
disease condition has developed. Cardiovascular drugs 
include those drugs used to lower serum cholesterol and 
those used to alter the level of serum HDL. 

[0056] As used herein, "combining" refers to contacting 
the biologically active agent with a cell or animal such that 
the agent is introduced into the cell or animal. For a cell any 
method that results in an agent traversing the plasma mem- 
brane is useful. For an animal any of the standard routes of 
administration of an agent, e.g. oral, rectal, transmucosal, 
intestinal, intravenous, intraperitoneal, intraventricular, sub- 
cutaneous, intramuscular, etc., can be utilized. 

[0057] As used herein,"positive response" refers to 
improving or ameliorating at least one symptom or detect- 
able characteristic of a disease or condition, e.g., lowering 
serum cholesterol levels or raising serum HDL levels. 

[0058] As used herein, "biological sample" refers to any 
cell type or tissue of a subject from which nucleic acid, 
particularly DNA, can be obtained. 

[0059] As used herein, "array" refers to a collection of 
three or more items, such a collection of immobilized 
nucleic acid probes arranged on a solid substrate, such as 
silica, polymeric materials or glass. 

[0060] As used herein, a composition refers to any mix- 
ture. It may be a solution, a suspension, liquid, powder, a 
paste, aqueous, non-aqueous or any combination thereof. 


[0061] As used herein, a combination refers to any asso- 
ciation between two or among more items. 

[0062] As used herein, "kit" refers to a package that 
contains a combination, such as one or more primers or 
probes used to amplify or detect polymorphic regions of 
genes associated with cardiovascular disease, optionally 
including instructions and/or reagents for their use. 

[0063] As used herein "specifically hybridizes" refers to 
hybridization of a probe or primer only to a target sequence 
preferentially to a non- target sequence. Those of skill in the 
art are familiar with parameters that affect hybridization; 
such as temperature, probe or primer length and composi- 
tion, buffer composition and salt concentration and can 
readily adjust these parameters to achieve specific hybrid- 
ization of a nucleic acid to a target sequence. 

[0064] As used herein "nucleic acid" refers to polynucle- 
otides such as deoxyribonucleic acid (DNA) and ribonucleic 
acid (RNA). The term should also be understood to include, 
as equivalents, derivatives, variants and analogs of either 
RNA or DNA made from nucleotide analogs, single (sense 
or antisense) and double-stranded polynucleotides. Deox- 
yribonucleotides include deoxyadenosine, deoxycytidine, 
deoxyguanosine and deoxy thymidine. For RNA, the uracil 
base is uridine. 

[0065] As used herein, "mass spectrometry" encompasses 
any suitable mass spectrometric format known to those of 
skill in the art. Such formats include, but are not limited to, 
Matrix-Assisted Laser Desorption/Ionization, Time-of- 
Flight (MALDI-TOF), Electrospray (ES), IR-MALDI (see, 
e.g., published International PCT Application No. 99/57318 
and U.S. Pat. No. 5,118,937) Ion Cyclotron Resonance 
(ICR), Fourier Transform and combinations thereof. 
MALDI, particular UV and IR, are among the preferred 
formats. 

[0066] B. Cytochrome c Oxidase VIb Gene 

[0067] Cytochrome c oxidase (COX) is a mitochondrial 
enzyme complex integrated in the inner membrane. It trans- 
fers electrons from cytochrome to molecular oxygen in the 
terminal reaction of the respiratory chain in eukaryotic cells. 
COX contains of three large subunits encoded by the mito- 
chondrial genome and 10 other subunits, encoded by nuclear 
genes. The three subunits encoded by mitochondrial genome 
are responsible for the catalytic activity. The cytochrome c 
oxidase subunit VIb (COX6B) is one of the nuclear gene 
products. The function of the nuclear encoded subunits is 
unknown. One proposed role is in the regulation of catalytic 
activity; specifically the rate of electron transport and sto- 
ichiometry of proton pumping. Other proposed roles are not 
directly related to electron transport and include energy- 
dependent calcium uptake and protein import by the mito- 
chondrion. Proteolytic removal of subunits Via and VIb has 
been associated with loss of calcium transport in reconsti- 
tuted vesicles. Steady-state levels of the COX6B transcript 
are different in different tissues (Taanman et al., Gene 
(1990), 93:285). 

[0068] The COX6B gene is genericaUy used to include the 
human COX6B gene and its homologs from rat, mouse, 
guinea pig, etc. 

[0069] Several single nucleotide polymorphism have been 
identified in the human COX6B gene. One of these is 
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located at position 86 and is a C to T transversion which is 
manifested as a silent mutation in the coding region, ACC to 
ACT (threonine to threonine)(SEQ ID NO.: 2). Although 
this is a silent mutation at the amino acid level, it may 
represent an alteration that changes codon usage, or it may 
effect mRNA stability or it may be in linkage disequilibrium 
with a non-silent change. Other known single nucleotide 
polymorphisms of the COX6B gene include, but are not 
limited to, those listed in Table 1. 


TABLE 1 


Gene GenB a nk Accession No. 

SNP 

SNP Location 

COX6B NM_001863 

C/T 

86 

(SEQ ID NO.: 1) 

A/G 

6D 


A/r 

324 


A/r 

123 


[0070] Based on methods disclosed herein and those used 
in the art, one of skill would be able to utilize all the SNPs 
described and find additional polymorphic regions of the 
COX6B gene to determine whether allelic variants of these 
regions are associated with high cholesterol levels and 
cardiovascular disease. 

[0071] C. GPI-1 Gene 

[0072] Glycosylphosphatidylinositol (GPI) functions to 
anchor various eukaryotic proteins to membranes and is 
essential for their surface expression. Thus, a defect in GPI 
anchor synthesis affects various functions of cell, tissues and 
organs. Biosynthesis of glycosylphosphatidylinositol (GPI) 
is initiated by the transfer of N-acetylglucosamine (GIcNAc) 
from UDP-GIcNAc to phosphatidylinositol (PI) and is cata- 
lyzed by a GIcNAc transferase, GPI-GIcNAc transferase 
(GPI-GnT). Four mammalian gene products form a protein 
complex that is responsible for this enzyme activity (PIG-A, 
PIG-H, PIG-C and GPI-1). PIG-A, PIG-H, PIG-C are 
required for the first step in GPI anchor biosynthesis; GPI-1 
is not. Stabilization of the enzyme complex, rather than 
participation in GIcNAc transfer, has been suggested as a 
possible role for GPI-1 (Watanabe et al. EMBO (1998) 17: 
877). 

[0073] The GPI-1 gene is generically used to include the 
human GPI-1 gene and its homologs from rat, mouse, guinea 
pig, etc. 

[0074] A polymorphism has been identified at position 
2577 of the human GPI-1 gene. This is a G to A transversion. 
This SNP is located in the 3' untranslated region of the 
mRNA, and does not affect protein structure, but may affect 
mRNA stability or may be in linkage disequilibrium with a 
non-silent change. Other known single nucleotide polymor- 
phisms of the GPI-1 gene include, but are not limited to, 
those listed in Table 2. 

TABLE 2 


GenBank 

Gene Accession No. SNP SNP Location 


GPI-1 NM_004204 C/T 2829 

(SEQ ID NOS.: 6, 7) A/G 2577 

C/T 2519 

C/T 2289 


TABLE 2-continued 



GenBank 



Accession No. SNP 

SNP Location 


C/T 

193S 


C/G 

1563 


A/G/QT 

2664 


A/G 

2656 


A/QT 

2167 


G/C/A 

2166 


[0075] Based on methods disclosed herein and those used 
in the art, one of skill would be able to use all the described 
SNPs and find additional polymorphic regions of the GPI-1 
gene to determine whether allelic variants of these regions 
are associated with low levels of HDL and cardiovascular 
disease . 

[0076] D. Other Genes and Polymorphism Associated 
with Cardiovascular Disease 

[0077] Many other genes and polymorphisms contained 
within them have been associated with risks factors for 
cardiovascular disease (aberrations in lipid metabolism; 
specifically high levels of serum cholesterol and low levels 
of HDL, etc.) and/or the clinical phenotypes of atheroscle- 
rosis and cardiovascular disease. Table 3 presents a list of 
some of these genes and some associated polymorphisms 
(SNPs): cholesterol ester transfer protein, plasma (CETP); 
apolipoprotein A-IV (APO A4); apolipoprotein A-I (APO 
Al); apolipoprotein E (APO E); apolipoprotein B (APO B); 
apolipoprotein C-III (APO C3); a gene encoding lipoprotein 
lipase (LPL); ATP -binding cassette transporter (ABC 1); 
paraoxonase 1 (PON 1); paraoxonase 2 (PON 2); 5,10- 
methylenetetrahydrofolate r reductase (MTHFR); a gene 
encoding hepatic lipase (LIPC); E-se lectin; G protein beta 3 
subunit and angiotensin II type 1 receptor gene. The SNP 
locations are based on the GenBank sequence. Table 3 is not 
meant to be exhaustive, as one of skill in the art based on the 
disclosure would be able to readily use other known poly- 
morphisms in these and other genes, new polymorphisms 
discovered in previously identified genes and newly identi- 
fied genes and polymorphisms in the methods and compo- 
sitions disclosed herein. 


TABLE 3 




GenBank 


SNP 



Accession No. 

SNP 

Location 

CETP 


NM_000078 

C/A 

991 

(SEQ ID NOS.: 

11, 12) 


QT 

196 




A/G 

15S6 




A/G 

1394 




A/G 

1439 




C/G 

1297 




C/T 

766 




G/A 

1131 




G/A 

1696 

LPL 


NM_0Q0237 

A/G 

1127 

(SEQ ID NOS.: 

13, 14) 


A/C 

3447 




C/T 

1973 




QT 

3343 




G/A 

2851 




QT 

3272 




A/T 

242S 




T/C 

2743 




G/A 

1453 
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TABLE 3-contimied 


Gene 


GenBaiLk 

SNP 

SNP 

Location 




C/A 

3449 




G-/A 

1282 




G/A 

579 




A/C 

133S 




A/G/T/C 

2416-2426 




A/G 

2427 




C/T 

1302 




G/A 

609 




G/C 

1595 




G/A 

1309 




CAT 

2454 




C/T 

2988 




G/A 

280 




G/A 

1036 

APO A4 


NM_000482 

G/T 

1122 

(SEQ ID NOS.: 

15, 16) 


G/C 

1033 




G/A 

1002 




C/T 

960 




C/T 

894 




G/A 

554 




G/A 

950 




T/C 

336 




G/A 

334 




C/T 

330 




A/G 

201 




A/G 

16 




A/r 

1213 

APO E 


NM_000041 

C/T 

448 

(SEQ ID NOS.: 

17, IS) 


G/A 

44S 

(mRNA) 



C/T 

586 




C/T 

197 




C/T 

540 

Hepatic Lipase 


NM_000236 

C/G 

680 

(SEQ ID NOS.: 

19, 20) 


G/A 

1374 




G/A 

701 




C/A 

1492 




A/G 

648 




G/C 

729 




G/A 

340 




G/T 

522 

PON 1 


NM_000446 

A/r 

172 

(SEQ ID NOS.: 

21, 22) 


A/G 

5S4 




G/C 

190 

PON 2 


XM_004947 

C/G 

475 

(SEQ ID NOS.: 

23, 24) 


C/G 

964 

APO C3 


NM_00Q040 

CAT 

14S 

(SEQ ID NOS.: 

25, 26) 


T/A 

471 




G/C 

386 




G/T 

417 




T/A 

495 

ABC 1 


XM_005567 

G/A 

8591 

(SEQ ID NOS.: 

27, 28) 




APO Al 


NTVL_a00039 

C/G 

770 

(SEQ ID NOS.: 

29, 30) 


G/A 

656 




C/G 

589 




C/G 

414 




A/r 

430 




CAT 

708 




CAT 

221 




T/G 

223 




CAT 

597 




A/G 

340 




G/C 

690 

APO B 


NM_000384 

A/G/C/T 

13141 

(SEQ ID NOS.: 

31, 32) 


A/G/C/T 

12669 




CAT 

11323 




G/C 

10422 




A/C 

1040S 




C/G 

10083 




CAT 

7064 




CAT 

6666 




CAT 

1980 




C/G 

5751 


TABLE 3-contimied 


GenBank SNP 
Gene Accession No. SNP Location 


C/T 7673 

C/A/GAT 8344 

G/CAT/A 4393 

A/CAT/G 5894 

AAT 12019 

CAT 11973 

G/CAT/A 7065 

C/G 947 

C/G 7331 

A/G 7221 

G/C 6402 

G/C 3780 

C/G 1661 

A/T 8167 

C/A 8126 

CAT 421 

C/T 1981 

G/A 12510 

G/C 12937 

APO B (can't) G/A 11042 

C/T 2834 

A/G 5869 

A/G 11962 

C/G 4439 

G/A 7824 

G/A 13569 

G/A 9489 

G/A 2325 

G/A 10259 

C/G 14 

MTHFR NM_005957 G/A 5442 

(SEQ ID NOS.: 33, 34) A/G 5113 

A/G 5113 

A/G 5110 

A/G 5102 

A/CAT 5097 

A/CAT 5097 

CAT 5079 

C/T 5079 

T/C 5071 

T/C 5071 

T/C 5051 

G/A 5012 

C/A 5000 

A/G 49SS 

A/G 4994 

A/G 4994 

A/G 4994 

C/T 4991 

C/T 4991 

C/T 4991 

A/G 4986 

A/G 4986 

A/G 4986 

C/T 4985 

T/A 4982 

T/G 4981 

T/C 4981 

T/C 4981 

MTHFR (con't) G/C/A 4967 

G/A 4963 

A/G 4962 

G/CAT 4962 

A/C/GAT 4961 

A/C/T 4961 

A/C 4961 

A/C 4961 

A/CAT 4960 

T/C 4938 

T/C 4937 

T/C 4933 

G/CAT 4933 

C/T 4929 
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TABLE 3-contimied 


GenBank SNP 
Gene Accession No. SNP Location 


C/T 4929 

T/A/G 4929 

A/G 4928 

G/C 492S 

C/G 4927 

G/A 4923 

C/T 4919 

A/T/G 4913 

C/T 4912 

A/T 4903 

C/T 4902 

A/G 4900 

G/A 4S9S 

G/T 4S9S 

C/T 4S97 

GJT 4894 

T/C/G 4836 

C/T 3862 

C/T 4922 

C/T 4959 

T/C 4981 

A/G 4994 

AJG 5044 

T/C 5051 

G/C 5066 

C/T 5079 

MTHFR (con't) C/A/G 5085 

C/T 5092 

A/G 5103 

A/G 5113 

C/T 1021 

E-Selectin NM_000450 G/A 3484 

(SEQ ID NOS.: 35, 36) G/A 3093 

T/G 2939 

T/C 2902 

C/T 1937 

C/T 1916 

C/T 1839 

C/T 1805 


Position 99 1 <C/A) 
PCR primers: 

Forward: 

Reverse : 

MassEXTEHDTM Primer: 
Primer Mass: 

Extended Primer-Allele C: 
Extended Primer Mass: 
Extended Primer-Allele A: 
Extended Primer Mass: 
Position 196 (CIT) 
PCR primers: 
Forward: 
Reverse : 

Mass EXTEND ™ Primer: 


TABLE 3-contimied 


GenBank SNP 
Gene Accession No. SNP Location 



C/T 

151S 





C/T 

1376 


G/A 

999 


T/C 

857 


A/C 

561 


C/G 

506 


A/G 

392 


G/T 

98 

G protein (53 subunit NM_0Q2075 

C/T 

1S2S 

(SEQ ID NOS.: 37, 38) 

CT 

1546 


G/T 

1431 


G/A 

1231 


C/T 

1230 

Angiotensin II type 1 NM_00686 

G/A 

1453 

receptor gene 

C/G 

96S 

(SEQ ID NOS.: 39, 40) 

G/C 

966 


T/C 

941 


G/A 

894 


T/C 

659 


[0078] Assays to identify the nucleotide present at the 
polymorphic site include those described herein and all 
others known to those who practice the art. 

[0079] For some of the SNPs described above, there are 
provided a description of the MassEXTEND™ reaction 
components that can be utilized to determine the allelic 
variant that is present. Included are the forward and reverse 
primers used for amplification. Also included are the Mas- 
sEXTEND™ primer used in the primer extension reaction 
and the extended MassEXTEND™ primers for each allele. 
MassEXTEND™ reactions are carried out and the products 
analyzed as described in Examples 2 and 3. 

[0080] CETP 


ACT GC C T GAT AACC AT GC TG (SEQ ID NO.: 41) 

ATACT T ACAC AC C AGG AGGG- ( SEQ ID NO . : 42) 

ATGCCTGCTCCAAAGGCAC (SEQ ID HO . : 43 > 
5757. B 

ATGCC T G CTC CAAAGG CACC (SEQ ID HO . : 44) 
6030.9 

ATGCC T G CTC CAAAGG CACAT (SEQ ID HO . : 45) 
6359.2 

TACTTCTGGTTCTCTGAGCG ( SEQ ID NO . : 46) 

ACT C AC C TTGAACT C G TC TC (SEQ ID HO.: 47) 

TGGTTCTCTGAGCGAGTCTT (SEQ ID HO . : 48) 
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Primer Mass: 

Extended Prime r-All el e C: 
Extended Primer Mass: 
Extended Primer— All el e T: 
Extended Primer Mass: 
Position 1586 (AIG) 
POR primers: 
Forward: 
Reverse : 

MassEXTENDTM Primer: 
Primer Mass: 

Extended Primer— Allele G: 
Extended Primer Mass: 
Extended Primer-Allele A: 
Extended Primer Mass; 
APOA4 

Position 1122 (GIT) 
POR primers: 
Forward: 
Reverse : 

Mass EXTEND *" Primer: 
Primer Mass: 

Extended Primer-Allele G: 
Extended Primer Mass: 
Extended Primer-Allele T : 
Extended Primer Mass: 
Position 1033 (GIC) 
PCR primers: 
Forward: 

MassEXTEND M Primer: 
Primer Mass: 

Extended Primer-Allele G: 
Extended Primer Mass: 
Extended Primer-Allele C: 
Extended Primer Mass: 
Position 1002 (G/A) 


■continued 

6130 

TGGTTCTCTGAGCGAGTCTTC ( SEQ ID HO . : 49) 
6707.4 

TGGTTCTCTGAGCGAGTCTTTC ( SEQ ID NO . : 50) 
6333. 1 


TGCAGATGGACTTTGGCTTC 
TGCTTGCCTTCTGCTACAAG 
CTTCCCTGAGCACCTGCTG 
5715.7 

CTTCCCTGAGCACCTGCTGGT 
6333. 1 

CTTCCCTGAGCACCTGCTGA 
601 2.9 


( SEQ ID NO . : 5 1) 

( SEQ ID NO . : 52 ) 

(SEQ ID NO. : 53) 

( SEQ ID NO . : 54 ) 

(SEQ ID NO. : 55) 


AACAGCTCAGGACGAAACTG 
AGAAGGAGTTGACCTTGTCC 
GGAAGCTCAAGTGGCCTTC 
5328. 8 

GGAAGCTCAAGTGGCCTTCC 
6102.0 

GGAAGCTCAAGTGGCCTTC AAC 
6728.4 


(SEQ ID NO. : 56) 

( SEQ ID NO . : 57 ) 

( SEQ ID NO . : 5)8) 

(SEQ ID NO. : 59) 

( SEQ ID NO . : 60 ) 


AAGTCACTGGCAGAGCTGG 
GC AC C AGGGCT TTGT TGAAG 
TTTTCCCCGTAGGGCTCCA 
5730.7 

TTTTCCCCGTAGGGCTCCAC 
6003.9 

TTTTCCCCGTAGGGCTCCAGC 
6333. 1 


( SEQ ID NO . : 6 1) 

( SEQ ID NO . : 62 ) 

(SEQ ID NO. : 63) 

( SEQ ID NO . : 64 ) 

( SEQ ID NO . : 65 ) 
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PGR primers: 
Forward: 
Reverse : 

Mass EXTEND " Primer: 
Primer Mass: 

Extended Prime r-All el e G: 
Extended Primer Mass: 
Extended Primer-Allele A: 
Extended Primer Mass: 
Position 9 60 (CIT) 
PCR primers: 
Forward: 
Reverse : 

MassEXTEND '" Primer: 
Primer Mass: 

Extended Primer-Allele T : 
Extended Primer Mass: 
Extended Primer-Allele C: 
Extended Primer Mass: 
Position 8 94 (CIT) 
PCR primers: 
Forward: 
Reverse : 

MassEXTEMD " Primer: 
Primer Mass: 

Extended Primer-Allele C: 
Extended Primer Mass: 
Extended Primer-Allele T: 
Extended Primer Mass: 
Position 554 (G/A) 
PCR primers: 
Forward: 
Reverse : 

MassEXTEMD " Primer: 
Primer Mass: 

Extended Primer-Allele A: 
Extended Primer Mass: 


■continued 

TGCAGAAGTCACTGGCAGAG ( SEQ ID HO . : 66) 

GTTGAAGTTTTCCCCGTAGG (SEQ ID NO . : 67) 

ACTCCTCCACCTGCTGGTC ( SEQ ID NO . : 68) 
5675.7 

ACTCCTCCACCTGCTGGTCC (SEQ ID NO.: 69) 
5948.9 

ACTCCTCCACCTGCTGGTCTA ( SEQ ID NO . : 70) 
6277. 1 


AGGACGTGCGTGGCAACCTG ( SEQ ID NO . : 7 1) 

AGCTGTGCCAGTGACTTCTG (SEQ ID NO . : 72) 

GTGACTTCTGCAGCCCCTC ( SEQ ID NO . : 73) 

571 5.7 

GTGACTTCTGCAGCCCCTC A ( SEQ ID NO . : 74) 

601 2.9 

GTGACTTCTGGAGCCCCTCGGT (SEQ ID NO . : 75) 
6662.3 


CCTGACCTTCCAGATGAAG 
TCAGGTTGCCACGCACGTC 
CAGGATCTCGGCCAGTGC 
5500.6 

CAGGATCTCGGCCAGTGCC 
5773.8 

CAGGATCTCGGCCAGTGCTG 
61 18.0 


(SEQ ID NO. : 76) 

(SEQ ID NO. : 77) 

( SEQ ID HO . : 78 ) 

( SEQ ID NO . : 79 ) 

( SEQ ID NO. : 80) 


AC CT G CGAGAGCTT C AGC AG ( SEQ ID NO . : 81) 

TCTCCATGCGCTGTGCGTAG ( SEQ ID NO.: 82) 

AGCTGCGCACCCAGGTCA ( SEQ ID NO . : 83) 
5469.6 

AGCTGCGCACCCAGGTCAA (SEQ ID NO . : 84) 
5766.8 


US 2003/0224418 Al 


11 


Dec. 4, 2003 


Extended Primer-Allele G: 
Extended Primer Mass; 
APOE 

Position 4 48 (CIT) 
PCR primers: 
Forward: 

MassEXTEND " Primer: 
Primer Mass: 

Extended Primer-Allele C: 
Extended Primer Mass: 
Extended Primer— Allele T: 
Extended Primer Mass: 
LPL 

Position 1127 (A/G) 
PCR primers: 
Forward: 
Reverse : 

MassEXTEND " Primer: 
Primer Mass: 

Extended Primer-Allele A: 
Extended Primer Mass: 
Extended Primer-Allele G: 
Extended Primer Mass: 
Position 3447 (A/C) 
PCR primers: 
Forward: 
Reverse : 

MassEXTEHDTM Primer: 
Primer Mass: 

Extended Primer-Allele A: 
Extended Primer Mass: 
Extended Primer-Allele C: 
Extended Primer Mass: 
Position 1973 (C/TI 
PCR primers: 
Forward: 
Reverse : 


■continued 

AGCTGCGCACCCAGGTCAGC ( SEQ ID HO.: 85) 

6072.0 


TGTC C AAGGAGCTGC AGGC (SEQ ID NO.: 86) 

CTTACGCAGCTTGCGCAGGT ( SEQ ID WO . : 87) 

GCGGAGATGGAGGACGTG (SEQ ID HO . : 88) 

5S29.7 

GCGGACATGGAGGACGTGC (SEQ ID HO . : 89) 

5902.8 

GCGGACATGGAGGACGTGTG ( SEQ ID HO . : 90) 

6247. 1 


GTTGTAGAAAGAACCGCTGC ( SEg XD HO . : 9 1) 

GAGAACGAGTCTTCAGGTAC (SEQ ID HO . : 92) 

ACAATCTGGGCTATGAGATCA (SEQ ID HO . : 93) 

6454.2 

ACAATCTGGGCTATGAGATCAA (SEQ ID HO . : 94) 
6751 .4 

AC AAT CT GGGC TAT G AGATC AGT (SEQ ID HO . : 95) 
7071 .6 


GACTCTACACTGCATGTCTC 
ACCCTTCTGAAAAGGAGAGG 
GAGGAGAGACAAGGCAGATA 
6273. 1 

GAGGAGAGACAAGGCAGATAT 
6561.3 

GAGGAGAGACAAGGCAGATAGT 
6890.5 


( SEQ ID HO. : 96) 

( SEQ ID HO . : 97 ) 

(SEQ ID HO. : 98) 

( SEQ ID HO . : 99 ) 

(SEQ ID HO. : 100 ) 


AAAGGTTCAGTTGCTGCTGC (SEQ ID HO . : 101) 

GCTGGGGAAGGTCTAATAAC (SEQ ID HO . : 102) 
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MassEXTENDTM Primer: 
Primer Mass: 

Extended Primer-Allele C: 
Extended Primer Mass: 
Extended Primer-Allele T: 
Extended Primer Mass: 
LIPC 

Position 6 80 (CIG) 
PGR primers: 
Forward: 
Reverse: 

MassEXTEMD ™ Primer: 
Primer Mass: 

Extended Primer-Allele C: 
Extended Primer Mass: 
Extended Primer-Allele G: 
Extended Primer Mass: 
Position 137 4 (GIA) 
PCR primers: 
Forward: 
Reverse: 

MassEXTEMD " Primer: 
Primer Mass: 

Extended Primer-Allele A: 
Extended Primer Mass: 
Extended Primer-Allele G: 
Extended Primer Mass: 
Position 701 ( G/A) 
PGR primers: 
Forward: 
Reverse : 

MassEXTEMD ™ Primer: 
Primer Mass: 

Extended Primer-Allele A: 
Extended Primer Mass: 
Extended Primer-Allele G: 
Extended Primer Mass: 


-continued 

GTTGCTGCTGCCTCGAATG ( SEQ ID HO.: 103) 

5770.7 

GTTGCTGCTGCCTCGAATCC (SEQ ID NO . : 104) 

6043.9 

GTTGCTGCTGCCTCGAATCTG (SEQ ID HO . : 105) 

6388.2 


CGTCTTTCTCCAGATGATGC 
AGTGTCCTATGGGCTGTTTG 
GGATGCCATTCATACCTTTAC 
6556. 1 

GGATGCCATTCATACCTTTACC 
6629.3 

GGATGCCATTCATACCTTTAC GC (SEQ ID HO . : 110) 
6958.5 


(SEQ ID HO. : 106 ) 

(SEQ ID HO. : 107 ) 

( SEQ ID HO . : 108 ) 

( SEQ ID HO . : 109 ) 


TGGGAAAACAGTGCAGTGTG 
TGATCGTCTTCAGAACGAGG 
CCAGACCATCATCCCATGGA 
6030.9 

CCAGACCATCATCCCATGGAA 
6328. 1 

CCAGACCATCATCCCATGGAGC 
6633.3 


( SEQ ID HO . : 111) 

(SEQ ID HO. : 1 12 ) 

( SEQ ID HO . : 113) 

(SEQ ID HO. : 114 ) 

( SEQ ID HO . : 1 15 ) 


CAGCAATCGTCTTTCTCCAG 
TCCTATGGGCTGTTTGATGC 
GTCTTTCTCCAGATGATGCCA 
6372.2 

GTCTTTCTCCAGATGATGCCAA 
6669.4 

GTCTTTCTCCAGATGATGCCAGT (SEQ ID HO.: 120) 
6939. 6 


( SEQ ID HO . : 116) 

( SEQ ID HO . : 117) 

(SEQ ID HO. : 118 ) 

( SEQ ID HO. : 119 ) 
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[0081] E. Databases 

[0082] Databases for determining an association between 
polymorphic regions of genes and intermediate and clinical 
phenotypes, comprise biological samples (e.g., blood) which 
provide a source of nucleic acid and clinical data covering 
diseases (e.g., age, sex, ethnicity medical history and family 
medical history) from both individuals exhibiting the phe- 
notype (intermediate phenotype (risk factor) or clinical 
phenotype (disease)) and those who do not. These databases 
include human population groups such as twins, diverse 
affected families, isolated founder populations and drug trial 
subjects. The quality and consistency of the clinical 
resources are of primary importance. 

[0083] F. Association Studies 

[0084] The examples set forth below utilized an extreme 
trait analysis to discover an association between an allelic 
variant of the COX6B gene and high cholesterol and an 
association between an allelic variant of the GPI-1 gene and 
low HDL. This analysis is based on comparing a pair of 
pools of DNA from individuals who exhibit respectively 
hypo or hypernormal levels of a biochemical trait (e.g., 
cholesterol or HDL) and individually examining SNPs for a 
difference in allelic frequency between the pools. An asso- 
ciation is considered to be positive if a statistically signifi- 
cant value of at least 3.841 using a 1-degree-of-freedom 
chi-squared test of association, p=0.05, is obtained. Standard 
multiple testing corrections are applied if more than one 
SNP is considered at a time, i.e., multiple SNPs are tested 
during the same study. Although not always required, it may 
be necessary to further examine the frequency of allelic 
variants in other populations, including those exhibiting 
normal levels of the given trait. 

[0085] For a qualitative trait (e.g., hypertension) associa- 
tion studies are based on determining the occurrence of 
certain alleles in a given population of diseased vs. healthy 
individuals. 

[0086] Allelic variants of COX6B, GPI-1 and other genes 
found to associate with high cholesterol, low HDL and/or 
cardiovascular disease can represent useful markers for 
indicating a predisposition for developing a risk factor for 
cardiovascular disease. These allelic variants may not nec- 
essarily represent functional variants affecting the expres- 
sion, stability, or activity of the encoded protein product. 
Those of skill in the art would be able to determine which 
allelic variants are to be used, alone or in conjunction with 
other variants, only for indicating a predisposition for car- 
diovascular disease or for profiling of drug reactivity and for 
determining those which may be also useful for screening 
for potential therapeutics. 

[0087] Any method used to determine association can be 
utilized to discover or confirm the association of other 
polymorphic regions in the COX6B gene, the GPI-1 gene or 
any other gene that may be associated with cardiovascular 
disease. 

[0088] G. Detection of Polymorphisms 

[0089] 1. Nucleic Acid Detection Method 

[0090] Generally, these methods are based in sequence- 
specific polynucleotides, oligonucleotides, probes and prim- 
ers. Any method known to those of skill in the art for 
detecting a specific nucleotide within a nucleic acid 


sequence or for determining the identity of a specific nucle- 
otide in a nucleic acid sequence is applicable to the methods 
of determining the presence or absence of an allelic variant 
of a COX6B gene or GPI-1 gene or another gene associated 
with cardiovascular disease. Such methods include, but are 
not limited to, techniques utilizing nucleic acid hybridiza- 
tion of sequence-specific probes, nucleic acid sequencing, 
selective amplification, analysis of restriction enzyme 
digests of the nucleic acid, cleavage of mismatched hetero- 
duplexes of nucleic acid and probe, alterations of electro- 
phoretic mobility, primer specific extension, oligonucleotide 
ligation assay and single -stranded conformation polymor- 
phism analysis. In particular, primer extension reactions that 
specifically terminate by incorporating a dideoxynucleotide 
are useful for detection. Several such general nucleic acid 
detection assays are described in U.S. Pat. No. 6,030,778. 

[0091] a. Primer Extension -Based Methods 

[0092] Several primer extension -based methods for deter- 
mining the identity of a particular nucleotide in a nucleic 
acid sequence have been reported (see, e.g., PCT Applica- 
tion No. PCT/US 9 6/03651 (W096/29431), PCT Applica- 
tion No. PCT/US97/20444 (WO 98/20019), PCT Applica- 
tion No. PCT/US91/00046 (WO91/13075), and U.S. Pat. 
No. 5,856,092). In general, a primer is prepared that spe- 
cifically hybridizes adjacent to a polymorphic site in a 
particular nucleic acid sequence. The primer is then 
extended in the presence of one or more dideoxynucleotides, 
typically with at least one of the dideoxynucleotides being 
the complement of the nucleotide that is polymorphic at the 
site. The primer and/or the dideoxynucleotides may be 
labeled to facilitate a determination of primer extension and 
identity of the extended nucleotide. 

[0093] In a preferred method, primer extension and/or the 
identity of the extended nucleotide(s) are determined by 
mass spectrometry (see, e.g., PCT Application Nos. PCT/ 
US96/03651 (W096/29431) and PCT/US97/20444 (WO 
98/20019)). 

[0094] b. Polymorphism-Specific Probe Hybridization 

[0095] A preferred detection method is allele specific 
hybridization using probes overlapping the polymorphic site 
and having about 5, 10, 15, 20, 25, or 30 nucleotides around 
the polymorphic region. The probes can contain naturally 
occurring or modified nucleotides (see U.S. Pat. No. 6,156, 
501). For example, oligonucleotide probes may be prepared 
in which the known polymorphic nucleotide is placed cen- 
trally (allele-specific probes) and then hybridized to target 
DNA under conditions which permit hybridization only if a 
perfect match is found (Saiki et al. (1986) Nature 324: 163; 
Saiki et al. (1989) Proc. Natl Acad. Sci USA 86: 6230; and 
Wallace et al. (1979) Nucl. Acids Res. 6: 3543). Such allele 
specific oligonucleotide hybridization techniques may be 
used for the simultaneous detection of several nucleotide 
changes in different polymorphic regions. For example, 
oligonucleotides having nucleotide sequences of specific 
allelic variants are attached to a hybridizing membrane and 
this membrane is then hybridized with labeled sample 
nucleic acid. Analysis of the hybridization signal will then 
reveal the identity of the nucleotides of the sample nucleic 
acid. In a preferred embodiment, several probes capable of 
hybridizing specifically to allelic variants are attached to a 
solid phase support, e.g., a "chip". Oligonucleotides can be 
bound to a solid support by a variety of processes, including 
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lithography. For example a chip can hold up to 250,000 
oligonucleotides (GeneChip, Affymetrix, Santa Clara, 
Calif.). Mutation detection analysis using these chips com- 
prising oligonucleotides, also termed "DNA probe arrays" is 
described e.g., in Cronin et al. (1996) Human Mutation 7: 
244 and in Kozal et al. (1996) Nature Medicine 2: 753. In 
one embodiment, a chip includes all the allelic variants of at 
least one polymorphic region of a gene. The solid phase 
support is then contacted with a test nucleic acid and 
hybridization to the specific probes is detected. Accordingly, 
the identity of numerous allelic variants of one or more 
genes can be identified in a simple hybridization experiment. 

[0096] C. Nucleic Acid Amplification-Based Methods 

[0097] In other detection methods, it is necessary to first 
amplify at least a portion of a COX6B gene, GPI-1 gene or 
another gene associated with cardiovascular disease prior to 
identifying the allelic variant. Amplification can be per- 
formed, e.g., by PCR and/or LCR, according to methods 
known in the art. In one embodiment, genomic DNA of a 
cell is exposed to two PCR primers and amplification is 
performed for a number of cycles sufficient to produce the 
required amount of amplified DNA. In preferred embodi- 
ments, the primers are located between 1 50 and 350 base 
pairs apart. 

[0098] Alternative amplification methods include: self 
sustained sequence replication (Guatelli, J. C. et al. (1990) 
Proc. Natl. Acad. Sci. U.S.A. 87: 1874-1878); transcrip- 
tional amplification system (Kwoh, D. Y. et al. (1989) Proc. 
Natl. Acad. Sci. U.S.A. 86: 1173-1177), Q-Beta Replicase 
(Lizardi, P. M. et al. (1988) Bio/Technology 6: 1197), or any 
other nucleic acid amplification method, followed by the 
detection of the amplified molecules using techniques well 
known to those of skill in the art. These detection schemes 
are especially useful for the detection of nucleic acid mol- 
ecules if such molecules are present in very low numbers. 

[0099] Alternatively, allele specific amplification technol- 
ogy, which depends on selective PCR amplification may be 
used in conjunction with the alleles provided herein. Oligo- 
nucleotides used as primers for specific amplification may 
carry the allelic variant of interest in the center of the 
molecule (so that amplification depends on differential 
hybridization) (Gibbs et al. (1989) Nucleic Acids Res. 
17:2437-2448) or at the extreme 3 1 end of one primer where, 
under appropriate conditions, mismatch can prevent, or 
reduce polymerase extension (Prossner (1993) Tibtech 
11:238; Newton et al. (1989) Nucl. Acids Res. 17:2503). In 
addition it may be desirable to introduce a restriction site in 
the region of the mutation to create cleavage -based detection 
(Gasparini et al. (1992) Mol. Cell Probes 6:1). 

[0100] d. Nucleic Acid Sequencing-Based Methods 

[0101] In one embodiment, any of a variety of sequencing 
reactions known in the art can be used to directly sequence 
at least a portion of the COX6B gene, GPI-1 gene or other 
gene associated with cardiovascular disease and to detect 
allelic variants, e.g., mutations, by comparing the sequence 
of the sample sequence with the corresponding wild-type 
(control) sequence. Exemplary sequencing reactions include 
those based on techniques developed by Maxam and Gilbert 
(Proc. Natl. Acad. Sci. USA (1977) 74:560) or Sanger 
(Sanger et al. (1977) Proc. Natl. Acad. Sci 74:5463). It is 
also contemplated that any of a variety of automated 


sequencing procedures may be used when performing the 
subject assays (Biotechniques (1995) 19:448), including 
sequencing by mass spectrometry (see, for example, U.S. 
Pat. No. 5,547,835 and International PCT Application No. 
WO 94/16101, entitled DNA Sequencing by Mass Spec- 
trometry by H. Koster; U.S. Pat. No. 5,547,835 and Inter- 
national PCT Application No. WO 94/21822, entitled "DNA 
Sequencing by Mass Spectrometry Via Exonuclease Degra- 
dation" by H. Koster), and U.S. Pat. No. 5,605,798 and 
International Patent Application No. PCT/US96/03651 
entitled DNA Diagnostics Based on Mass Spectrometry by 
H. Koster; Cohen et al. (1996) Adv Chromatogr 36:127-162; 
and Griffin et al. (1993) Appl Biochem Biotechnol 38:147- 
159). It will be evident to one skilled in the art that, for 
certain embodiments, the occurrence of only one, two or 
three of the nucleic acid bases need be determined in the 
sequencing reaction. For instance, A-track sequencing or an 
equivalent, e.g., where only one nucleotide is detected, can 
be carried out. Other sequencing methods are disclosed, e.g., 
in U.S. Pat. No. 5,580,732 entitled "Method of DNA 
sequencing employing a mixed DNA-polymer chain probe" 
and U.S. Pat. No. 5,571,676 entitled "Method for mismatch- 
directed in vitro DNA sequencing". 

[0102] e. Restriction Enzyme Digest Analysis 

[0103] In some cases, the presence of a specific allele in 
nucleic acid, particularly DNA, from a subject can be shown 
by restriction enzyme analysis. For example, a specific 
nucleotide polymorphism can result in a nucleotide 
sequence containing a restriction site which is absent from 
the nucleotide sequence of another allelic variant. 

[0104] f. Mismatch Cleavage 

[0105] Protection from cleavage agents, such as, but not 
limited to, a nuclease, hydroxylamine or osmium tetroxide 
and with piperidine, can be used to detect mismatched bases 
in RNA/RNA DNA/DNA, or RNA/DNA heteroduplexes 
(Myers, et al. (1985) Science 230:1242). In general, the 
technique of "mismatch cleavage" starts by providing het- 
eroduplexes formed by hybridizing a control nucleic acid, 
which is optionally labeled, e.g., RNAor DNA, comprising 
a nucleotide sequence of an allelic variant with a sample 
nucleic acid, e.g, RNA or DNA, obtained from a tissue 
sample. The double -stranded duplexes are treated with an 
agent, which cleaves single-stranded regions of the duplex 
such as duplexes formed based on basepair mismatches 
between the control and sample strands. For instance, RNA/ 
DNA duplexes can be treated with RNase and DNA/DNA 
hybrids treated with SI nuclease to enzymatically digest the 
mismatched regions. 

[0106] In other embodiments, either DNA/DNA or RNA/ 
DNA duplexes can be treated with hydroxylamine or 
osmium tetroxide and with piperidine in order to digest 
mismatched regions. After digestion of the mismatched 
regions, the resulting material is then separated by size on 
denaturing polyacrylamide gels to determine whether the 
control and sample nucleic acids have an identical nucle- 
otide sequence or in which nucleotides they differ (see, for 
example, Cotton et al. (1988) Proc. Natl Acad Sci USA 85: 
4397; Saleeba et al. (1992) Methods Enzymol. 217: 286- 
295). The control or sample nucleic acid is labeled for 
detection. 
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[0107] g. Electrophoretic Mobility Alterations 
[0108] In other embodiments, alteration in electrophoretic 
mobility is used to identify the type of allelic variant in the 
COX6B gene, GPI-1 gene or other gene associated with 
cardiovascular disease. For example, single-strand confor- 
mation polymorphism (SSCP) may be used to detect differ- 
ences in electrophoretic mobility between mutant and wild 
type nucleic acids (Orita et al. (1989) Proc. Natl. Acad. Sci. 
USA 86:2766, see also Cotton (1993) Mutat Res 285:125- 
144; and Hayashi (1992) Genet Anal Tech Appl 9:73-79). 
Single-stranded DNA fragments of sample and control 
nucleic acids are denatured and allowed to renature. The 
secondary structure of single -stranded nucleic acids varies 
according to sequence, the resulting alteration in electro- 
phoretic mobility enables the detection of even a single base 
change. The DNA fragments may be labeled or detected 
with labeled probes. The sensitivity of the assay may be 
enhanced by using RNA (rather than DNA), in which the 
secondary structure is more sensitive to a change in 
sequence. In another preferred embodiment, the subject 
method utilizes heteroduplex analysis to separate double 
stranded heteroduplex molecules on the basis of changes in 
electrophoretic mobility (Keen et al. (1991) Trends Genet 
7:5). 

[0109] h. Polyacrylamide Gel Electrophoresis 
[0110] In yet another embodiment, the identity of an 
allelic variant of a polymorphic region in the COX6B gene, 
GPI-1 gene or other gene associated with cardiovascular 
disease is obtained by analyzing the movement of a nucleic 
acid comprising the polymorphic region in polyacrylamide 
gels containing a gradient of denaturant is assayed using 
denaturing gradient gel electrophoresis (DGGE) (Myers et 
al. (1985) Nature 313:495). When DGGE is used as the 
method of analysis, DNA will be modified to ensure that it 
does not completely denature, for example by adding a GC 
clamp of approximately 40 bp of high-melting GC-rich 
DNA by PCR. In a further embodiment, a temperature 
gradient is used in place of a denaturing agent gradient to 
identify differences in the mobility of control and sample 
DNA (Rosenbaum and Reissner (1987) Biophys Chem 
265:1275). 

[0111] i. Oligonucleotide Ligation Assay (OLA) 
[0112] In another embodiment, identification of the allelic 
variant is carried out using an oligonucleotide ligation assay 
(OLA), as described, e.g., in U.S. Pat. No. 4,998,617 and in 
Landegren, U. et al., Science 241:1077-1080 (1988). The 
OLA protocol uses two oligonucleotides which are designed 
to be capable of hybridizing to abutting sequences of a single 
strand of a target. One of the oligonucleotides is linked to a 
separation marker, e.g,. biotinylated, and the other is detect- 
ably labeled. If the precise complementary sequence is 
found in a target molecule, the oligonucleotides will hybrid- 
ize such that their termini abut, and create a ligation sub- 
strate. Ligation then permits the labeled oligonucleotide to 
be recovered using avidin, or another biotin ligand. Nick- 
erson, D. A. et al. have described a nucleic acid detection 
assay that combines attributes of PCR and OLA(Nickerson, 
D. A. et al., Proc. Natl. Acad. ScL (U.S.A.) 87:8923-8927 
(1990). In this method, PCR is used to achieve the expo- 
nential amplification of target DNA, which is then detected 
using OLA. 

[0113] Several techniques based on this OLA method have 
been developed and can be used to detect specific allelic 


variants of a polymorphic region of a gene. For example, 
U.S. Pat. No. 5,593,826 discloses an OLA using an oligo- 
nucleotide having 3' -amino group and a 5'-phosphorylated 
oligonucleotide to form a conjugate having a phosphorami- 
date linkage. In another variation of OLA described in Tobe 
et al. (1996) Nucl. Acids Res. 24: 3728), OLA combined 
with PCR permits typing of two alleles in a single microliter 
well. By marking each of the allele-specific primers with a 
unique hapten, i.e. digoxigenin and fluorescein, each OLA 
reaction can be detected by using hapten specific antibodies 
that are labeled with different enzyme reporters, alkaline 
phosphatase or horseradish peroxidase. This system permits 
the detection of the two alleles using a high throughput 
format that leads to the production of two different colors. 

[0114] j. SNP Detection Methods 

[0115] Also provided are methods for detecting single 
nucleotide polymorphisms. Because single nucleotide poly- 
morphisms constitute sites of variation flanked by regions of 
invariant sequence, their analysis requires no more than the 
determination of the identity of the single nucleotide present 
at the site of variation and it is unnecessary to determine a 
complete gene sequence for each patient. Several methods 
have been developed to facilitate the analysis of such single 
nucleotide polymorphisms. 

[0116] In one embodiment, the single base polymorphism 
can be detected by using a specialized exonuclease-resistant 
nucleotide, as disclosed, e.g., in Mundy, C. R. (U.S. Pat. No. 
4,656,127). According to the method, a primer complemen- 
tary to the allelic sequence immediately 3' to the polymor- 
phic site is permitted to hybridize to a target molecule 
obtained from a particular animal or human. If the polymor- 
phic site on the target molecule contains a nucleotide that is 
complementary to the particular exonuclease-resistant 
nucleotide derivative present, then that derivative will be 
incorporated onto the end of the hybridized primer. Such 
incorporation renders the primer resistant to exonuclease, 
and thereby permits its detection. Since the identity of the 
exonuclease-resistant derivative of the sample is known, a 
finding that the primer has become resistant to exonucleases 
reveals that the nucleotide present in the polymorphic site of 
the target molecule was complementary to that of the 
nucleotide derivative used in the reaction. This method has 
the advantage that it does not require the determination of 
large amounts of extraneous sequence data. 

[0117] In another embodiment, a solution-based method 
for determining the identity of the nucleotide of a polymor- 
phic site is employed (Cohen, D. et al. (French Patent 
2,650,840; PCT Application No. WO91/02087)). As in the 
Mundy method of U.S. Pat. No. 4,656,127, a primer is 
employed that is complementary to allelic sequences imme- 
diately 3' to a polymorphic site. The method determines the 
identity of the nucleotide of that site using labeled dideoxy- 
nucleotide derivatives, which, if complementary to the 
nucleotide of the polymorphic site will become incorporated 
onto the terminus of the primer. 

[0118] k. Genetic Bit Analysis 

[0119] An alternative method, known as Genetic Bit 
Analysis or GBA™ is described by Goelet, et al. (U.S. Pat. 
No. 6,004,744, PCT Application No. 92/15712). The method 
of Goelet, et al. uses mixtures of labeled terminators and a 
primer that is complementary to the sequence 3' to a 
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polymorphic site. The labeled terminator that is incorporated 
is thus determined by, and complementary to, the nucleotide 
present in the polymorphic site of the target molecule being 
evaluated. In contrast to the method of Cohen et al. (French 
Patent 2,650,840; PCT Application No. WO 9 1/02087), the 
method of Goelet, et al. is preferably a heterogeneous phase 
assay, in which the primer or the target molecule is immo- 
bilized to a solid phase. 

[0120] I. Other Primer-Guided Nucleotide Incorporation 
Procedures 

[0121] Other primer-guided nucleotide incorporation pro- 
cedures for assaying polymorphic sites in DNA have been 
described (Komher, J. S. et al., Nucl. Acids Res. 17:7779- 
7784 (1989); Sokolov, B. P., Nucl. Acids Res. 18:3671 
(1990), Syvanen, A. C, et al., Genomics 8:684-692 (1990), 
Kuppuswamy, M. N. et al., Proc. Natl. Acad. Sci. (U.S.A.) 
88:1143-1147 (1991); Prezant, T. R. et al., Hum. Mutat. 
1:159-164 (1992); Ugozzoli, L. et al., GATA 9:107-112 

(1992) ; Nyren, P. et al., Anal. Biochem. 208:171-175 

(1993) ). These methods differ from GBA™ in that they all 
rely on the incorporation of labeled deoxynucleotides to 
discriminate between bases at a polymorphic site. In such a 
format, since the signal is proportional to the number of 
deoxynucleotides incorporated, polymorphisms that occur 
in runs of the same nucleotide can result in signals that are 
proportional to the length of the run (Syvanen, A. C, et al., 
Amer. J. Hum. Genet. 52:46-59 (1993)). 

[0122] For determining the identity of the allelic variant of 
a polymorphic region located in the coding region of a gene, 
yet other methods than those described above can be used. 
For example, identification of an allelic variant which 
encodes a mutated protein can be performed by using an 
antibody specifically recognizing the mutant protein in, e.g., 
immunohistochemistry or immunoprecipitation. Binding 
assays are known in the art and involve, e.g., obtaining cells 
from a subject, and performing binding experiments with a 
labeled lipid, to determine whether binding to the mutated 
form of the protein differs from binding to the wild-type 
protein. 

[0123] m. Molecular Structure Determination 

[0124] If a polymorphic region is located in an exon, either 
in a coding or non-coding region of the gene, the identity of 
the allelic variant can be determined by determining the 
molecular structure of the mRNA, pre-mRNA, or cDNA. 
The molecular structure can be determined using any of the 
above described methods for determining the molecular 
structure of the genomic DNA, e.g., sequencing and SSCP. 

[0125] n. Mass Spectrometric Methods 

[0126] Nucleic acids can also be analyzed by detection 
methods and protocols, particularly those that rely on mass 
spectrometry (see, e.g., U.S. Pat. No. 5,605,798, allowed 
co-pending U.S. application Ser. No. 08/617,256, allowed 
co-pending U.S. application Ser. No. 08/744,481, U.S. 
application Ser. No. 08/990,851, International PCT Appli- 
cation No. WO 98/20019). These methods can be automated 
(see, e.g., co-pending U.S. application Ser. No. 09/285,481, 
which describes an automated process line). Preferred 
among the methods of analysis herein are those involving 
the primer oligo base extension (PROBE) reaction with 
mass spectrometry for detection (described herein and else- 
where, see e.g., U.S. application Ser. Nos. 08/617,256, 


09/287,681, 09/287,682, 09/287,141 and 09/287,679, 
allowed co-pending U.S. application Ser. No. 08/744,481, 
International PCT Application No. PCT/US 97/20444, pub- 
lished as International PCT Application No. WO 98/20019, 
and based upon U.S. application Ser. Nos. 08/744,481, 
08/744,590, 08/746,036, 08/746,055, 08/786,988, 08/787, 
639, 08/933,792, 08/746,055, 08/786,988 and 08/787,639; 
see, also U.S. application Ser. No. 09/074,936, allowed U.S. 
application Ser. No. 08/787,639, and U.S. application Ser. 
Nos. 08/746,055 and 08/786,988, and published Interna- 
tional PCT Application No. WO 98/20020). 

[0127] A preferred format for performing the analyses is a 
chip based format in which the biopolymer is linked to a 
solid support, such as a silicon or silicon-coated substrate, 
preferably in the form of an array. More preferably, when 
analyses are performed using mass spectrometry, particu- 
larly MALDI, nanoliter volumes of sample are loaded on, 
such that the resulting spot is about, or smaller than, the size 
of the laser spot. It has been found that when this is achieved, 
the results from the mass spectrometric analysis are quan- 
titative. The area under the peaks in the resulting mass 
spectra are proportional to concentration (when normalized 
and corrected for background). Methods for preparing and 
using such chips are described in allowed co-pending U.S. 
application Ser. No. 08/787,639, co-pending U.S. applica- 
tion Ser. Nos. 08/786,988, 09/364,774, 09/371,150 and 
09/297,575; see, also U.S. application Ser. No. PCT/US97/ 
20195, which published as International PCT Application 
No. WO 98/20020. Chips and kits for performing these 
analyses are commercially available from SEQUENOM 
under the trademark MassARRAY™. MassARRAY™ relies 
on the fidelity of the enzymatic primer extension reactions 
combined with the miniaturized array and MALDI -TOF 
(Matrix-Assisted Laser Desorption Ionization- Time of 
Flight) mass spectrometry to deliver results rapidly. It accu- 
rately distinguishes single base changes in the size of DNA 
fragments relating to genetic variants without tags. 

[0128] Multiplex methods allow for the simultaneous 
detection of more than one polymorphic region in a particu- 
lar gene or polymorphic regions in several genes. This is the 
preferred method for carrying out haplotype analysis of 
allelic variants of the COX6B and/or GPI-1 genes sepa- 
rately, or along with allelic variants of one or more other 
genes associated with cardiovascular disease. 

[0129] Multiplexing can be achieved by several different 
methodologies. For example, several mutations can be 
simultaneously detected on one target sequence by employ- 
ing corresponding detector (probe) molecules (e.g., oligo- 
nucleotides or oligonucleotide mimetics). The molecular 
weight differences between the detector oligonucleotides 
must be large enough so that simultaneous detection (mul- 
tiplexing) is possible. This can be achieved either by the 
sequence itself (composition or length) or by the introduc- 
tion of mass-modifying functionalities into the detector 
oligonucleotides (see below). 

[0130] Mass modifying moieties can be attached, for 
instance, to either the 5'-end of the oligonucleotide, to the 
nucleobase (or bases), to the phosphate backbone, and to the 
2'-position of the nucleoside (nucleosides) and/or to the 
terminal 3' -position. Examples of mass modifying moieties 
include, for example, a halogen, an azido, or of the type, XR, 
wherein X is a linking group and R is a mass-modifying 
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functionality. The mass-modifying functionality can thus be 
used to introduce defined mass increments into the oligo- 
nucleotide molecule. 

[0131] The mass -modifying functionality can be located at 
different positions within the nucleotide moiety (see, e.g., 
U.S. Pat. No. 5,547,835 and International PCT Application 
No. WO 94/21822). For example, the mass- modifying moi- 
ety, M, can be attached either to the nucleobase, (in case of 
the c 7 -deazanucleosides also to C-7), to the triphosphate 
group at the alpha phosphate or to the 2' -position of the sugar 
ring of the nucleoside triphosphate. Modifications intro- 
duced at the phosphodiester bond, such as with alpha-thio 
nucleoside triphosphates, have the advantage that these 
modifications do not interfere with accurate Watson-Crick 
base-pairing and additionally allow for the one-step post- 
synthetic site-specific modification of the complete nucleic 
acid molecule e.g., via alkylation reactions (see, e.g., Naka- 
maye et al. (1988) Nucl. Acids Res. 16:9947-59). Particu- 
larly preferred mass-modifying functionalities are boron- 
modified nucleic acids since they are better incorporated into 
nucleic acids by polymerases (see, e.g., Porter et al. (1995) 
Biochemistry 34:11963-11969; Hasan et al. (1996) Nucleic 
Acids Res. 24:2150-2157; Li et al. (1995) Nucl. Acids Res. 
23:4495-4501). 

[0132] Furthermore, the mass-modifying functionality can 
be added so as to affect chain termination, such as by 
attaching it to the S'-position of the sugar ring in the 
nucleoside triphosphate. For those skilled in the art, it is 
clear that many combinations can be used in the methods 
provided herein. In the same way, those skilled in the art will 
recognize that chain-elongating nucleoside triphosphates 
can also be mass-modified in a similar fashion with numer- 
ous variations and combinations in functionality and attach- 
ment positions. 

[0133] For example, without being bound to any particular 
theory, the mass-modification can be introduced for X in XR 
as well as using oligo-/polyethylene glycol derivatives for R. 
The mass-modifying increment (m) in this case is 44, i.e. 
five different mass-modified species can be generated by just 
changing m from 0 to 4 thus adding mass units of 45 (m=0), 
89 (m=l), 133 (m=2), 177 (m=3) and 221 (m=4) to the 
nucleic acid molecule (e.g., detector oligonucleotide (D) or 
the nucleoside triphosphates, respectively). The oligo/poly- 
ethylene glycols can also be mono alkylated by a lower alkyl 
such as, but are not limited to, methyl, ethyl, propyl, 
isopropyl and t-butyl. Other chemistries can be used in the 
mass-modified compounds (see, e.g., those described in 
Oligonucleotides and Analogues, A Practical Approach, F. 
Eckstein, editor, IRL Press, Oxford, 1991). 

[0134] In yet another embodiment, various mass-modify- 
ing functionalities, R, other than oligo/polyethylene glycols, 
can be selected and attached via appropriate linking chem- 
istries, X. A simple mass-modification can be achieved by 
substituting H for halogens, such as F, CI, Br and/or I, or 
pseudohalogens such as CN, SCN, NCS, or by using dif- 
ferent alkyl, aryl or aralkyl moieties such as methyl, ethyl, 
propyl, isopropyl, t-butyl, hexyl, phenyl, substituted phenyl, 
benzyl, or functional groups such as CH 2 F, CHF 2 , CF 3 , 
Si(CH 3 ) 3 , Si(CH 3 ) 2 (C 2 H 5 ), Si(CH 3 )(C 2 H5) 2 , Si(C 2 H 5 ) 3 . Yet 
another mass-modification can be obtained by attaching 
homo- or heteropeptides through the nucleic acid molecule 
(e.g., detector (D)) or nucleoside triphosphates). One 


example, useful in generating mass-modified species with a 
mass increment of 57, is the attachment of oligoglycines (m) 
to nucleic acid molecules (r), e.g., mass-modifications of 74 
(r=l, ni=0), 131 (r=l, m=l), 188 (r=l, m=2), 245 (r=l, m=3) 
are achieved. Simple oligoamides also can be used, e.g., 
mass-modifications of 74 (r=l, m=0), 88 (r=2, m=0), 102 
(r=3, m=0), 116(r=4, m=0), etc. are obtainable. Variations in 
additions to those set forth herein will be apparent to the 
skilled artisan. 

[0135] Different mass-modified detector oligonucleotides 
can be used to simultaneously detect all possible variants/ 
mutants simultaneously. Alternatively, all four base permu- 
tations at the site of a mutation can be detected by designing 
and positioning a detector oligonucleotide, so that it serves 
as a primer for a DNA/RNA polymerase with varying 
combinations of elongating and terminating nucleoside 
triphosphates. For example, mass modifications also can be 
incorporated during the amplification process. 

[0136] A different multiplex detection format is one in 
which differentiation is accomplished by employing differ- 
ent specific capture sequences which are position-specifi- 
cally immobilized on a flat surface (e.g., a 'chip array"). If 
different target sequences Tl-Tn are present, their target 
capture sites TCSl-TCSn will specifically interact with 
complementary immobilized capture sequences Cl-Cn. 
Detection is achieved by employing appropriately mass 
differentiated detector oligonucleotides Dl -Dn, which are 
mass modifying functionalities Ml-Mn. 

[0137] o. Other Methods p Additional methods of analyz- 
ing nucleic acids include amplification-based methods 
including polymerase chain reaction (PCR), ligase chain 
reaction (LCR), mini-PCR, rolling circle amplification, 
autocatalytic methods, such as those using OJ replicase, 
TAS, 3SR, and any other suitable method known to those of 
skill in the art. 

[0138] Other methods for analysis and identification and 
detection of polymorphisms, include but are not limited to, 
allele specific probes, Southern analyses, and other such 
analyses. 

[0139] 2. Primers and Probes 

[0140] Primers refer to nucleic acids which are capable of 
specifically hybridizing to a nucleic acid sequence which is 
adjacent to a polymorphic region of interest or to a poly- 
morphic region and are extended. A primer can be used 
alone in a detection method, or a primer can be used together 
with at least one other primer or probe in a detection method. 
Primers can also be used to amplify at least a portion of a 
nucleic acid. For amplifying at least a portion of a nucleic 
acid, a forward primer (i.e., 5' primer) and a reverse primer 
(i.e., 3' primer) will preferably be used. Forward and reverse 
primers hybridize to complementary stands of a double 
stranded nucleic acid, such that upon extension from each 
primer, a double stranded nucleic acid is amplified. 

[0141] Probes refer to nucleic acids which hybridize to the 
region of interest and which are not further extended. For 
example, a probe is a nucleic acid which hybridizes adjacent 
to or at a polymorphic region of a COX6B gene, a GPI-1 
gene or another gene associated with cardiovascular disease 
and which by hybridization or absence of hybridization to 
the DNAof a subject will be indicative of the identity of the 
allelic variant of the polymorphic region of the gene. Pre- 
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ferred probes have a number of nucleotides sufficient to 
allow specific hybridization to the target nucleotide 
sequence. Where the target nucleotide sequence is present in 
a large fragment of DNA, such as a genomic DNA fragment 
of several tens or hundreds of kilobases, the size of a probe 
may have to be longer to provide sufficiently specific hybrid- 
ization, as compared to a probe which is used to detect a 
target sequence which is present in a shorter fragment of 
DNA. For example, in some diagnostic methods, a portion 
of a COX6B gene, a GPI-1 gene or another gene associated 
with cardiovascular disease may first be amplified and thus 
isolated from the rest of the chromosomal DNA and then 
hybridized to a probe. In such a situation, a shorter probe 
will likely provide sufficient specificity of hybridization. For 
example, a probe having a nucleotide sequence of about 10 
nucleotides may be sufficient. 

[0142] Preferred primers and probes hybridize adjacent to 
or at the polymorphic sites described in TABLES 1-3. In 
addition, preferred primers include SEQ ID NOS.: 5, 10, 43, 
48, 53, 58, 63, 68, 73, 78, 83, 88, 93, 98, 103, 108, 113, and 
118. 

[0143] Primers and probes (RNA, DNA (single -stranded 
or double-stranded), PNA and their analogs) described 
herein may be labeled with any detectable reporter or signal 
moiety including, but not limited to radioisotopes, enzymes, 
antigens, antibodies, spectrophotometric reagents, chemilu- 
minescent reagents, fluorescent and any other light produc- 
ing chemicals. Additionally, these probes may be modified 
without changing the substance of their purpose by terminal 
addition of nucleotides designed to incorporate restriction 
sites or other useful sequences, proteins, signal generating 
ligands such as acridinium esters, and/or paramagnetic par- 
ticles. 

[0144] These probes may also be modified by the addition 
of a capture moiety (including, but not limited to para- 
magnetic particles, biotin, fluorescein, dioxigenin, antigens, 
antibodies) or attached to the walls of microtiter trays to 
assist in the solid phase capture and purification of these 
probes and any DNA or RNA hybridized to these probes. 
Fluorescein may be used as a signal moiety as well as a 
capture moiety, the latter by interacting with an anti-fluo- 
rescein antibody. 

[0145] Any probe or primer can be prepared according to 
methods well known in the art and described, e.g., in 
Sambrook, J. Fritsch, E. F., and Maniatis, T. (1989( Molecu- 
lar Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N Y. For example, 
discrete fragments of the DNA can be prepared and cloned 
using restriction enzymes. Alternatively, probes and primers 
can be prepared using the Polymerase Chain Reaction (PCR) 
using primers having an appropriate sequence. 

[0146] Oligonucleotides may be synthesized by standard 
methods known in the art, e.g. by use of an automated DNA 
synthesizer (such as are commercially available from Bio- 
search (Novato, Calif.); Applied Biosystems (Foster City, 
Calif.), etc.). As examples, phosphorothioate oligonucle- 
otides may be synthesized by the method of Stein et al. 
(1988, Nucl. Acids Res. 16:3209), methylphosphonate oli- 
gonucleotides can be prepared by use of controlled pore 
glass polymer supports (Sarin et al., 1988, Proc. Natl. Acad. 
Sci. U.S.A. 85:7448-7451), etc. 


[0147] H. Transgenic Animals 

[0148] Methods for making transgenic animals using a 
variety of transgenes have been described in Wagner et al. 

(1981) Proc. Nat. Acad. Sc. U.S.A. 78: 5016; Stewart et al. 

(1982) Science 217: 1046; Constantini et al. (1981) Nature 
294: 92; Lacy et al (1983) Cell 34: 343; McKnight et al. 

(1983) Cell 34: 335; Brinstar et al. (1983) Nature 306: 332; 
Palmiter et al. (1982) Nature 300: 611; Palmiter et al. (1982) 
Cell 29: 701; and Palmiter et al. (1983) Science 222: 809. 
Such methods are described in U.S. Pat. Nos. 6,175,057; 
6,180,849 and 6,133,502. 

[0149] The term "transgene" is used herein to describe 
genetic material that has been or is about to be artificially 
inserted into the genome of a mammalian cell, particularly 
a mammalian cell of a living animal. The transgene is used 
to transform a cell, meaning that a permanent or transient 
genetic change, preferably a permanent genetic change, is 
induced in a cell following incorporation of exogenous 
DNA. A permanent genetic change is generally achieved by 
introduction of the DNA into the genome of the cell. Vectors 
for stable integration include, but are not limited to, plas- 
mids, retroviruses and other animal viruses and YACS. Of 
interest are transgenic mammals, including, but are not 
limited to, cows, pigs, goats, horses and others, and particu- 
larly rodents, including rats and mice. Preferably, the trans- 
genic-animals are mice. 

[0150] Transgenic animals contain an exogenous nucleic 
acid sequence present as an extrachromosomal element or 
stably integrated in all or a portion of its cells, especially 
germ cells. Unless otherwise indicated, it will be assumed 
that a transgenic animal comprises stable changes to the 
germline sequence. During the initial construction of the 
animal, "chimeras" or "chimeric animals" are generated, in 
which only a subset of cells have the altered genome. 
Chimeras are primarily used for breeding purposes in order 
to generate the desired transgenic animal. Animals having a 
heterozygous alteration are generated by breeding of chi- 
meras. Male and female heterozygotes are typically bred to 
generate homozygous animals. 

[0151] The exogenous gene is usually either from a dif- 
ferent species than the animal host, or is otherwise altered in 
its coding or non-coding sequence. The introduced gene may 
be a wild- type gene, naturally occurring polymorphism (e.g., 
as described for COX6B, GPI-1 and other genes associated 
with cardiovascular disease) or a genetically manipulated 
sequence, for example having deletions, substitutions or 
insertions in the coding or non-coding regions. When the 
introduced gene is a coding sequence, it is usually operably 
linked to a promoter, which may be constitutive or inducible, 
and other regulatory sequences required for expression in 
the host animal. 

[0152] Transgenic animals can comprise other genetic 
alterations in addition to the presence of alleles of COX6B 
and/or GPI-1 genes. For example, the genome can be altered 
to affect the function of the endogenous genes, contain 
marker genes, or contain other genetic alterations (e.g., 
alleles of other genes associated with cardiovascular dis- 
ease). 

[0153] A "knock-out" of a gene means an alteration in the 
sequence of the gene that results in a decrease of function of 
the target gene, preferably such that target gene expression 
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is undetectable or insignificant. A knock-out of an endog- 
enous COX6B or GPI-1 gene means that function of the 
gene has been substantially decreased so that expression is 
not detectable or only present at insignificant levels. 
"Knock-out" transgenics can be transgenic animals having a 
heterozygous knock-out of the COX6B or GPI-1 gene or a 
homozygous knock-out of one or both of these genes. 
"Knock-outs" also include conditional knock-outs, where 
alteration of the target gene can occur upon, for example, 
exposure of the animal to a substance that promotes target 
gene alteration, introduction of an enzyme hat promotes 
recombination at the target gene site (e.g., Cre in the Cre-lox 
system), or other method for directing the target gene 
alteration postnatally. 

[0154] A "knock-in" of a target gene means an alteration 
in a host cell genome that results in altered expression (e.g., 
increased (including ectopic)) of the target gene, e.g., by 
introduction of an additional copy of the target gene, or by 
operatively inserting a regulatory sequence that provides for 
enhanced expression of an endogenous copy of the target 
gene. "Knock -in" transgenics of interest can be transgenic 
animals having a knock-in of the COX6B or GPI-1. Such 
transgenics can be heterozygous or homozygous for the 
knock-in gene. "Knock-ins" also encompass conditional 
knock -ins. 

[0155] A construct is suitable for use in the generation of 
transgenic animals if it allows the desired level of expression 
of a COX6B or GPI-1 encoding sequence or the encoding 
sequence of another gene associated with cardiovascular 
disease. Methods of isolating and cloning a desired 
sequence, as well as suitable constructs for expression of a 
selected sequence in a host animal, are well known in the art 
and are described below. 

[0156] For the introduction of a gene into the subject 
animal, it is generally advantageous to use the gene as a gene 
construct wherein the gene is ligated downstream of a 
promoter capable of and operably linked to expressing the 
gene in the subject animal cells. Specifically, a transgenic 
non-human mammal showing high expression of the desired 
gene can be created by microinjecting a vector ligated with 
said gene into a fertilized egg of the subject non-human 
mammal (e.g., rat fertilized egg) downstream of various 
promoters capable of expressing the protein and/or the 
corresponding protein derived from various mammals (rab- 
bits, dogs, cats, guinea pigs, hamsters, rats, mice etc., 
preferably rats etc.) Useful vectors include Escherichia 
co/i-derived plasmids, Bacillus subtilis-deiived plasmids, 
yeast-derived plasmids, bacteriophages such as lambda, 
phage, retroviruses such as Moloney leukemia virus, and 
animal viruses such as vaccinia virus or baculovirus. 

[0157] Useful promoters for such gene expression regu- 
lation include, for example, promoters for genes derived 
from viruses (cytomegalovirus, Moloney leukemia virus, JC 
virus, breast cancer virus etc.), and promoters for genes 
derived from various mammals (humans, rabbits, dogs, cats, 
guinea pigs, hamsters, rats, mice etc.) and birds (chickens 
etc.) (e.g., genes for albumin, insulin II, erythropoietin, 
endothelin, osteocalcin, muscular creatine kinase, platelet- 
derived growth factor beta, keratins Kl, K10 and K14, 
collagen types I and II, atrial natriuretic factor, dopamine 
beta-hydroxylase, endothelial receptor tyrosine kinase (gen- 
erally abbreviated Tie2), sodium-potassium adenosine triph- 


osphorylase (generally abbreviated Na,K-ATPase), neu- 
rofilament light chain, metallothioneins I and IIA, 
metalloproteinase I tissue inhibitor, MHC class I antigen 
(generally abbreviated H-2L), smooth muscle alpha actin, 
polypeptide chain elongation factor 1 alpha (EF-1 alpha), 
beta actin, alpha and beta myosin heavy chains, myosin light 
chains 1 and 2, myelin base protein, serum amyloid com- 
ponent, myoglobin, renin etc.). 

[0158] It is preferable that the above-mentioned vectors 
have a sequence for terminating the transcription of the 
desired messenger RNA in the transgenic animal (generally 
referred to as terminator); for example, gene expression can 
be manipulated using a sequence with such function con- 
tained in various genes derived from viruses, mammals and 
birds. Preferably, the simian virus SV40 terminator etc. are 
commonly used. Additionally, for the purpose of increasing 
the expression of the desired gene, the splicing signal and 
enhancer region of each gene, a portion of the intron of a 
eukaryotic organism gene may be ligated 5' upstream of the 
promoter region, or between the promoter region and the 
translational region, or 3' downstream of the translational 
region as desired. 

[0159] A translational region for a protein of interest can 
be obtained using the entire or portion of genomic DNA of 
blood, kidney or fibroblast origin from various mammals 
(humans, rabbits, dogs, cats, guinea pigs, hamsters, rats, 
mice etc.) or of various commercially available genomic 
DNA libraries, as a starting material, or using complemen- 
tary DNA prepared by a known method from RNA of blood, 
kidney or fibroblast origin as a starting material. Also, an 
exogenous gene can be obtained using complementary DNA 
prepared by a known method from RNA of human fibroblast 
origin as a starting material. All these translational regions 
can be utilized in transgenic animals. 

[0160] To obtain the translational region, it is possible to 
prepare DNA incorporating an exogenous gene encoding the 
protein of interest in which the gene is ligated downstream 
of the above-mentioned promoter (preferably upstream of 
the translation termination site) as a gene construct capable 
of being expressed in the transgenic animal. 

[0161] DNA constructs for random integration need not 
include regions of homology to mediate recombination. 
Where homologous recombination is desired, the DNA 
constructs will comprise at least a portion of the target gene 
with the desired genetic modification, and will include 
regions of homology to the target locus. Conveniendy, 
markers for positive and negative selection are included. 
Methods for generating cells having targeted gene modifi- 
cations through homologous recombination are known in 
the art. For various techniques for transfecting mammalian 
cells, see Keown et al. (1990) Methods in Enzymology 
185:527-537. 

[0162] The transgenic animal can be created by introduc- 
ing a COX6B or GPI-1 gene construct into, for example, an 
unfertilized egg, a fertilized egg, a spermatozoon or a 
germinal cell containing a primordial germinal cell thereof, 
preferably in the embryogenic stage in the development of 
a non-human mammal (more preferably in the single-cell or 
fertilized cell stage and generally before the S-cell phase), by 
standard means, such as the calcium phosphate method, the 
electric pulse method, the lipofection method, the aggluti- 
nation method, the microinjection method, the particle gun 
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method, the DEAE-dextran method and other such method. 
Also, it is possible to introduce a desired COX6B or GPI-1 
gene into a somatic cell, a living organ, a tissue cell, or the 
like, by gene transformation methods, and utilize it for cell 
culture, tissue culture etc. Furthermore, these cells may be 
fused with the above-described germinal cell by a com- 
monly known cell fusion method to create a transgenic 
animal. 

[0163] For embryonic stem (ES) cells, an ES cell line may 
be employed, or embryonic cells may be obtained freshly 
from a host, e.g. mouse, rat, guinea pig, etc. Such cells are 
grown on an appropriate fibroblast-feeder layer or grown in 
the presence of appropriate growth factors, such as leukemia 
inhibiting factor (LIF). When ES cells have been trans- 
formed, they may be used to produce transgenic animals. 
After transformation, the cells are plated onto a feeder layer 
in an appropriate medium. Cells containing the construct 
may be detected by employing a selective medium. After 
sufficient time for colonies to grow, they are picked and 
analyzed for the occurrence of homologous recombination 
or integration of the construct. Those colonies that are 
positive may then be used for embryo manipulation and 
blastocyst injection. Blastocysts are obtained from 4 to 6 
week old superovulated females. The ES cells are 
trypsinized, and the modified cells are injected into the 
blastocoel of the blastocyst. After injection, the blastocysts 
are returned to each uterine horn of pseudopregnant females. 
Females are then allowed to go to term and the resulting 
litters screened for mutant cells having the construct. By 
providing for a different phenotype of the blastocyst and the 
ES cells, chimeric progeny can be readily detected. The 
chimeric animals are screened for the presence of the 
modified gene and males and females having the modifica- 
tion are mated to produce homozygous progeny. If the gene 
alterations cause lethality at some point in development, 
tissues or organs can be maintained as allogeneic or con- 
genic grafts or transplants, or in in vitro culture. 

[0164] Animals containing more than one transgene, such 
as allelic variants of COX6B and/or GPI-1 and/or other 
genes associated with cardiovascular disease can be made by 
sequentially introducing individual alleles into an animal in 
order to produce the desired phenotype (manifestation or 
predisposition to cardiovascular disease). 

[0165] I. Effect of Allelic Variants on the Encoded Protein 
and Disease Related Phenotype 

[0166] The effect of an allelic variant on a COX6B or 
GPI-1 protein (altered amount, stability, location and/or 
activity) can be determined according to methods known in 
the art. Alielic variants of the COX6B and GPI-1 genes can 
be assayed individually or in combination with other vari- 
ants known to be associated with cardiovascular disease. 

[0167] If the mutation is located in an intron, the effect of 
the mutation can be determined, e.g., by producing trans- 
genic animals in which the allelic variant linked to lipid 
metabolism and/or cardiovascular disease has been intro- 
duced and in which the wild-type gene or predominant allele 
may have been knocked out. Comparison of the level of 
expression of the protein in the mice transgenic for the 
allelic variant with mice transgenic for the predominant 
allele will reveal whether the mutation results in increased or 
decreased synthesis of the associated protein and/or aberrant 
tissue distribution of the associated protein. Such analysis 


could also be performed in cultured cells, in which the 
human variant allele gene is introduced and, e.g., replaces 
the endogenous gene in the cell. Thus, depending on the 
effect of the alteration a specific treatment can be adminis- 
tered to a subject having such a mutation. Accordingly, if the 
mutation results in decreased production of a COX6B or 
GPI-1 protein, the subject can be treated by administration 
of a compound which increases synthesis, such as by 
increasing COX6B or GPI-1 gene expression, and wherein 
the compound acts at a regulatory element different from the 
one which is mutated. Alternatively, if the mutation results 
in increased COX6B or GPI-1 protein levels, the subject can 
be treated by administration of a compound which reduces 
protein production, e.g., by reducing COX6B or GPI-1 gene 
expression or a compound which inhibits or reduces the 
activity of COX6B or GPI-1 protein. 

[0168] J. Diagnostic and Prognostic Assays 

[0169] Typically, an individual allelic variant that associ- 
ates with a risk factor for cardiovascular disease will not be 
used in isolation as a prognosticator for a subject developing 
high cholesterol, low HDL or cardiovascular disease. An 
allelic variant typically will be one of a plurality of indica- 
tors that are utilized. The other indicators may be the 
manifestation of other risk factors for cardiovascular dis- 
ease, e.g., family history, high blood pressure, weight, 
activity level, etc., or additional allelic variants in the same 
or other genes associated with altered lipid metabolism 
and/or cardiovascular disease. 

[0170] Useful combinations of allelic variants of the 
COX6B gene and/or the GPI-1 gene can be determined by 
examining combinations of variants of these genes, which 
are assayed individually or assayed simultaneously using 
multiplexing methods as described above or any other 
labelling method that allows different variants to be identi- 
fied. In particular, variants of COX6B gene and/or the GPI-1 
gene may be assayed using kits (see below) or any of a 
variety micro arrays known to those in the art. For example, 
oligonucleotide probes comprising the polymorphic regions 
surrounding any polymorphism in the COX6B or GPI-1 
gene may be designed and fabricated using methods such as 
those described in U.S. Pat. Nos. 5,492,806; 5,525,464; 
5,695,940; 6,018,041; 6,025,136; WO 98/30883; WO 
98/56954; WO99/09218; WO 00/58516; WO 00/58519, or 
references cited therein. Similarly one of skill in the art can 
determine useful combinations of allelic variants of the 
COX6B and/or GPI-1 genes along with variants of other 
genes associated with cardiovascular disease. 

[0171] K. Pharmacogenomics 

[0172] It is likely that subjects having one or more differ- 
ent allelic variants of the COX6B or GPI-1 polymorphic 
regions will respond differently to therapeutic drugs to treat 
cardiovascular disease or conditions. For example, there are 
numerous drugs available for lowering cholesterol levels: 
including lovastatin (MEVACOR; Merck & Co.), simvas- 
tatin (XOCOR; Merck & Co.), dextrothyroxine 
(CHOLOXIN; Knoll Pharmaceutical Co.), pamaqueside 
(Pfizer), cholestryr amine (QUESTRAN; Bristol-Myers 
Squibb), colestipol (COLESTID; Pharmacia & Upjohn), 
acipomox (Pharmacia & Upjohn), fenofibrate (LIPIDIL), 
gemfibrozil (LOPID; Warner-Lambert), cerivastatin 
(LIPOBAY; Bayer), fluvastatin (LESCOL; Novartis), ator- 
vastatin (LIPITOR, Warner-Lambert), etofylline clofibrate 
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(DUOLIP; Merckle (Germany)), probucol (LORELCO; 
Hoechst Marion Roussel), omacor (Pronova (Norway), 
etofibrate (Merz (Germany), clofibrate (ATROMID-S; 
Wyeth-Ayerst (AHP)), and niacin (numerous manufactur- 
ers). All patients do not respond identically to these drugs. 
Alleles of the COX6B or the GPI-1 gene which associate 
with altered lipid metabolism will be useful alone or in 
conjunction with markers in other genes associated with the 
development of cardiovascular disease to predict a subject's 
response to a therapeutic drug. For example, multiplex 
primer extension assays or micro arrays comprising probes 
for alleles are useful formats for determining drug response. 
A correlation between drug responses and specific alleles or 
combinations of alleles of the COX6B or GPI-1 genes and 
other genes associated with cardiovascular disease can be 
shown, for example, by clinical studies wherein the response 
to specific drugs of subjects having different allelic variants 
of polymorphic regions of the COX6B or GPI-1 genes alone 
or in combination with allelic variants of other genes are 
compared. Such studies can also be performed using animal 
models, such as mice having various alleles and in which, 
e.g., the endogenous COX6B or GPI-1 genes have been 
inactivated such as by a knock-out mutation. Test drugs are 
then administered to the mice having different alleles and the 
response of the different mice to a specific compound is 
compared. Accordingly, assays, microarrays and kits are 
provided for determining the drug which will be best suited 
for treating a specific disease or condition in a subject based 
on the individual's genotype. For example, it will be pos- 
sible to select drugs which will be devoid of toxicity, or have 
the lowest level of toxicity possible for treating a subject 
having a disease or condition, e.g., cardiovascular disease or 
high cholesterol or low HDL. 

[0173] L. Kits 

[0174] Kits can be used to indicate whether a subject is at 
risk of developing high cholesterol, low HDL and/or car- 
diovascular disease. The kits can also be used to determine 
if a subject who has high cholesterol or low HDL carries 
associated variants in the COX6B or GPI-1 genes or other 
cardiovascular disease -related genes. This information could 
be used, e.g., to optimize treatment of such individuals as a 
particular genotype may be associated with drug response. 

[0175] In preferred embodiments, the kits comprise a 
probe or primer which is capable of hybridizing adjacent to 
or at a polymorphic region of a OX6B or GPI-1 gene and 
thereby identifying whether the COX6B or GPI-1 gene 
contains an allelic variant which is associated with cardio- 
vascular disease. Primers or probes that specifically hybrid- 
ize at or adjacent to the SNPs described in Tables 1-3 could 
be included. In particular, primers or probes which comprise 
the sequences of SEQ ID NOs.: 5, 10, 43, 48, 53, 58, 63, 68, 
73, 78, 83, 88, 93, 98, 103, 108, 113, and 118 could be 
included in the kits. The kits preferably further comprise 
instructions for use in carrying out assays, interpreting 
results and diagnosing a subject as having a predisposition 
toward developing high cholesterol, low HDL and/or car- 
diovascular disease. 

[0176] Preferred kits for amplifying a region of a COX6B 
gene, GPI-1 gene, or other genes associated with cardiovas- 
cular disease (such as those listed in Table 3) comprise two 
primers which flank a polymorphic region of the gene of 
interest. For example primers can comprise the sequences of 


SEQ ID NOs.: 3, 4, 8, 9, 41, 42, 46, 47, 51, 52, 56, 57, 61, 
62, 66, 67, 71, 72, 76, 77, 81, 82, 86, 87, 91, 92, 96, 97, 101, 
102, 106, 107, 111, 112, 116, and 117. For other assays, 
primers or probes hybridize to a polymorphic region or 5' or 
3' to a polymorphic region depending on which strand of the 
target nucleic acid is used. For example, specific probes and 
primers comprise sequences designated as SEQ ID NOs: 5, 
10, 43, 48, 53, 58, 63, 68, 73, 78, 83, 88, 93, 98, 103, 108, 
113, and 118. Those of skill in the art can synthesize primers 
and probes which hybridize adjacent to or at the polymor- 
phic regions described in TABLES 1-3 and other SNPs in 
genes associated with cardiovascular disease. 

[0177] Yet other kits comprise at least one reagent neces- 
sary to perform an assay. For example, the kit can comprise 
an enzyme, such as a nucleic acid polymerase. Alternatively 
the kit can comprise a buffer or any other necessary reagent. 

[0178] Yet other kits comprise microarrays of probes to 
detect allelic variants of COX6B, GPI-1, and other genes 
associated with cardiovascular disease. The kits further 
comprise instructions for their use and interpreting the 
results. 

[0179] The following examples are included for illustra- 
tive purposes only and are not intended to limit the scope of 
the invention. The practice of methods and development of 
the products provided herein employ, unless otherwise indi- 
cated, conventional techniques of cell biology, cell culture, 
molecular biology, transgenic biology, microbiology, recom- 
binant DNA, and immunology, which are within the skill of 
the art. Such techniques are explained fully in the literature. 
See, for example, Molecular Cloning A Laboratory Manual, 
2nd Ed., ed. by Sambrook, Fritsch and Maniatis (Cold 
Spring Harbor Laboratory Press: 1989); DNA Cloning, 
Volumes I and II (D. N. Glover ed., 1985); Oligonucleotide 
Synthesis (M. J. Gait ed., 1984); Mullis et al. U.S. Pat. No. 
4,683,195; Nucleic Acid Hybridization (B. D. Hames & S. 
J. Higgins eds. 1984); Transcription and Translation (B. D. 
Hames & S. J. Higgins eds. 1984); Culture of Animal CeUs 
(R. I. Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells 
and Enzymes (IRL Press, 1986); B. Perbal, A Practical 
Guide To Molecular Cloning (1984); the treatise, Methods 
In Enzymology (Academic Press, Inc., New York); Gene 
Transfer Vectors For Mammalian Cells (J. H. Miller and M. 
P. Calos eds., 1987, Cold Spring Harbor Laboratory); Meth- 
ods In Enzymology, Vols. 154 and 155 (Wu et al. eds., 
Immunochemical Methods In Cell and Molecular Biology 
(Mayer and Walker, eds., Academic Press, London, 1987); 
Handbook of Experimental Immunology, Volumes I-IV (D. 
M. Weir and C. C. BlackweU, eds., 1986); Manipulating the 
Mouse Embryo, (Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, N.Y., 1986). 

EXAMPLE 1 

[0180] Isolation of DNA from Blood Samples of a Strati- 
fied Population 

[0181] Blood samples were obtained from a population of 
unrelated Caucasian women between the ages of 18-79 
(average age=48). The women had, no response to media 
campaigns, attended the Twin Research Unit at the St. 
Thomas Hospital in London, England. For current purposes, 
only one member of a twin pair was used to insure that all 
observations were independent. Blood samples from 1400 
unrelated individuals were measured for levels of choles- 
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terol and HDL. Cholesterol and HDL level in blood samples 
were quantitated using standard assay methods. 

[0182] The population was stratified into pools of 200 
people, which represented the lower extreme and the upper 
extreme for serum levels of cholesterol and HDL. 


Cholesterol 

Pool 1: Individuals were considered to have low 

cholesterol (0.12-3.6 mmoles/L). 

Pool 2: Individuals were considered to have high. 

cholesterol (5.25-11.57 mmoles/L). 
HDL 

Pool 3: Individuals were considered to have low levels 

af HDL (0.240-1.11 mmoles/L) 
Pool 4: Individuals were considered to have high, levels 

of HDL (2.10-3.76 mmoles/L). 


[0183] DNA Extraction Protocol 

[0184] DNA was extracted from blood samples of each of 
the pools by utilizing the following protocol. 

[0185] Section 1 

[0186] 1. Blood was extracted into EDTA tubes. 

[0187] 2. Blood sample was spun at 3,000 rpm for 10 
minutes in a clinical centrifuge. 

[0188] 3. The buffy coat (the leukocytes, a yellowish 
layer of cells on top of the red blood cells) was removed 
and pooled into a 1 ml conical tube. 

[0189] 4. 0.9% saline was added to fill the tube and 
resuspend the leucocytes. Sample were immediately 
further processed or stored at 4° C. for 24 hrs. 

[0190] 5. The sample was spun at 2,500 rpm for 10 
minutes. 

[0191] 6. The buffy coat was again removed as cleanly 
as possible leaving behind any red cells, the sample was 
suspended in red cell lysis buffer and left for 20 minutes 
at 4° C. 

[0192] 7. The sample was spun again at 2,500 rpm for 
10 minutes. If a pellet of unlysed red cells remained 
lying above the leucocytes the treatment with red cell 
lysis buffer was repeated. 

[0193] 8. The leucocyte pellet was resuspended in 2 ml 
0.9% saline. 

[0194] 9. The DNA was liberated by the addition of 
leucocyte lysis buffer — the tube was capped and gently 
inverted several times, until the liquid became viscous 
with DNA. The samples were handled with care to 
avoid shearing and damage to the DNA. 

[0195] 10. Samples were frozen for storage prior to full 
extraction. 

[0196] Section 2 

[0197] 11. 2 ml of 5 M sodium perchlorate was added 
to the thawed sample and mixed by inversion. The 
sample was heated to 60° C. for 30-40 minutes to fully 
denature proteins. 


[0198] 12. An equal volume of chloroform/isoamyl 
alcohol (24:1) was added at room temperature and the 
sample mixed for 10 minutes. 

[0199] 13. The sample was spun without a break at 
3,000 rpm for 10 minutes. 

[0200] 14. The top aqueous phase was removed into a 
clean tube and two volumes of cold 100% ethanol 
added and mixed by inversion to precipitate DNA. 

[0201] 15. The DNA was removed using a sterile loop 
and resuspended in 1-5 ml TE buffer depending on the 
DNA yield. 

[0202] 16. The optical density was measured at 260 and 
280 nm to check yield and purity of the DNA sample. 
For use in Examples 2 and 3, all DNA had an absor- 
bance ratio of 1.6 at 260/280, a total yield of 32 /ig and 
a concentration of 10 ng//il. If initial purity levels were 
unacceptable a re-extraction was carried out (sections 
12-15 above). 

EXAMPLE 2 

[0203] Detection of an Association Between an SNP at 
Position 86 of the Human COX6B Gene and High Choles- 
terol 

[0204] DNA samples (as prepared in Example 1), repre- 
senting 200 women, from the lower extreme, pool 1 (low 
levels of cholesterol) and the upper extreme, pool 2 (high 
levels of cholesterol) were amplified and analyzed for 
genetic differences using a MassEXTEND™ assay detection 
method. For each pool, single nucleotide polymorphisms 
were examined throughout the entire genome to detect 
differences in allelic frequency of a variant allele between 
the pools. 

[0205] PCR Amplification of Samples from Pools 1 and 2 

[0206] PCR primers were synthesized by Operon 
(Alameda, Calif.) using phosphoramidite chemistry. Ampli- 
fication of the COX6B target sequence was carried out in 
two 50 /il PCR reactions with 100 ng of pooled human 
genomic DNA, obtained as described in Example 1, taken 
from samples in pool 1 or pool 2, although amounts ranging 
from 100 ng to 1 ug could be used. Individual DNA 
concentrations within the pooled samples were present in 
equal concentration with a final concentration of 0.5 ng. 
Each reaction contained lxPCR buffer (Qiagen, Valencia, 
Calif.), 200 //M dNTPs, 1U Hotstar Taq polymerase 
(Qiagen, Valencia, Calif.), 4 mM MgCl 2 , and 25 pmols of 
the long primer containing both the universal primer 
sequence and the target specific sequence 5 -AGCGGATAA- 
CAATTTCACACAGGTAGTCTGGTTCTGGTTGGGG-3 1 
(SEQ ID NO.: 4), 2 pmoles of the short primer S'-AGGAT- 
TCAGCACCATGGC-3' (SEQ ID NO.: 3) and 10 pmoles of 
a biotinylated universal primer complementary to the 5' end 
of the PCR amplicon 5 T -AGCGGATAACAATTTCACA- 
CAGG-3' (SEQ ID NO.: 121). Alternatively, the biotinylated 
universal primer could be S'-GGCGCACGCCTCCACG-S 1 
(SEQ ID NO.: 122). After an initial round of amplification 
with the target with the specific forward (long) and reverse 
primer (short), the 5' biotinylated universal primer then 
hybridized and acted as a reverse primer thereby introducing 
a 3' biotin capture moiety into the molecule. The amplifi- 
cation protocol results in a 5'-biotinylated double stranded 
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DNA amplicon and dramatically reduces the cost of high 
throughput genotyping by eliminating the need to 5' biotin 
label each forward primer used in a genotyping. Thermal 
cycling was performed in 0.2 mL tubes or 96 well plate 
using an MJ Research Thermal Cycler (Waltham, Mass.) 
(calculated temperature) with the following cycling param- 
eters: 94° C. for 5 min; 45 cycles: 94° C. for 20 sec, 56° C. 
for 30 sec, 72° C. for 60 sec; 72° C. 3 min. 

[0207] Immobilization of DNA 

[0208] The 50 fA PCR reaction was added to 25 fA of 
streptavidin coated magnetic bead (Dynal, Lake Success, 
N.Y.) prewashed three times and resuspended in 1 M NH 4 C1, 
0.06 M NH 4 OH. The PCR amplicons were allowed to bind 
to the beads for 15 minutes at room temperature. The beads 
were then collected with a magnet and the supernatant 
containing unbound DNA was removed. The unbound 
strand was released from the double stranded amplicons by 
incubation in 100 mM NaOH and washing of the beads three 
times with 10 mM Tris pH 8.0. 

[0209] Genotyping 

[0210] The frequency of the alleles at position 86 in the 
COX6B gene was measured using the MassEXTEND™ 
assay and MALDI-TOF. The SNP identified at position 86 of 
COX6B in the GenBank sequence is represented as a C to 
T transversion. The MassEXTEND™ assay used detected 
the sequence of the complementary strand, thus the SNP was 
represented as G to A in the primer extension products. The 
DNA coated magnetic beads were resuspended in 26 mM 
Tris-HCLpH 9.5, 6.5 mM MgCl 2 and 50 mM each of dTTPs 
and 50 mM each of ddCTP, ddATP, ddGTP, 2.5 U of a 
thermostable DNA polymerase (Amersham Pharmacia Bio- 
tech, Piscataway, N.J.) and 20 pmoles of a template specific 
oligonucleotide primer 5'-AATCAAGAACTACAAGAC-3' 
(SEQ ID NO.: 5) (Operon, Alameda, Calif.). Primer exten- 
sion occurred with three cycles of oligonucleotide primer 
hybridization and extension. The extension products were 
analyzed after denaturation from the template with 50 mM 
NH 4 C1 and transfer of 150 nl of each sample to a silicon chip 
preloaded with 150 nl of H3PA (3-hydroxy picolinic acid) 
(Sigma Aldrich, St Louis, Mo.) matrix material. The sample 
material was allowed to crystallize and analyzed by 
MALDI-TOF (Bruker Daltonics, Billerica, Mass.; PerSep- 
tive, Foster City, Calif.). The mass of the primer used in the 
MassEXTEND™ reaction was 5493.70 daltons. The pre- 
dominant allele is extended by the addition of ddC, which 
has a mass of 5766.90 daltons. The allelic variant results in 
the addition of dT and ddG to the primer to produce an 
extension product having a mass of 6111.10 daltons. 

[0211] In addition to being analyzed as part of a pool, each 
individual sample (0.5 ng) was amplified as described above 
and analyzed individually using a MassEXTEND™ reaction 
as described above. 

[0212] Pooled populations of women (200 women per 
pool) with high cholesterol (pool 2) showed an increase in 
the frequency of the A allele at nucleotide position 86 of 
COX6B as compared with those with low levels of choles- 
terol (pool 1) (see FIG. 1). The association of this allelic 
variant of the COX6B gene with high cholesterol gave a 
statistically significant value of 14.30 using a 1-degree-of- 
freedom chi-squared test of association. In other words, the 
increase of 2.75% to 9.05% is significant, with a p value of 


0.000156 (see FIG. 1). The genotype of each of the indi- 
viduals in the pooled population was also determined by 
carrying out MassEXTEND™ reactions on each DNA 
samples individually. These analysis confirmed the pooling 
data showing that there was an increase in the frequency of 
the A allele of 2.27% to 9.93%, (p=0.0000061). The geno- 
types in pool 2 showed a decrease in the homozygous GG 
genotype from 95.4% to 82.35% and an increase in the 
heterozygous GA genotype from 4.55% to 15.44%. None of 
the individuals with low levels of serum cholesterol exhib- 
ited the homozygous AA genotype. 

EXAMPLE 3 

[0213] Detection of an Association Between an SNP at 
Position 2577 of the Human GPI-1 Gene and Low HDL 

[0214] DNA samples (as prepared in Example 1), repre- 
senting 200 women, from pool 3 (low level of HDL) and 
pool 4 (high levels of HDL) were amplified and analyzed for 
genetic differences using a MassEXTEND™ detection 
method. For each pool, SNPs were examined throughout the 
genome to detect differences in allelic frequency of variant 
alleles between the pools. 

[0215] PCR Amplification of Samples from Pools 3 and 4 

[0216] PCR primers were synthesized by Operon 
(Alameda, Calif.) using phosphoramidite chemistry. Ampli- 
fication of the GPI-1 target sequence was carried out in 
single 50 /A PCR reaction with 100 ng of pooled human 
genomic DNA (200 samples), obtained as described in 
Example 1, taken from samples in pool 3 or pool 4, although 
amounts ranging from 100 ng to 1 ug could be used. 
Individual DNA concentrations within the pooled samples 
were present in equal concentration with the final concen- 
tration of 0.5 ng. Each reaction contained lxPCR buffer 
(Qiagen, Valencia, Calif.), 200 uM dNTPs, 1U Hotstar Taq 
polymerase (Qiagen, Valencia, Calif.), 4 mM MgCl z , and 25 
pmols of the forward primer containing both the universal 
primer sequence and the target specific short sequence 
5-AGCAGGGCTTCCTCCTTC-3' (SEQ ID NO.: 8) 2 
pmoles of the long primer 5' -AGCGG ATAAC AATTTCA- 
CACAGGTGACCCAGCCGTACCTATTC-3' (SEQ ID 
NO.: 9) and 10 pmoles of a biotinylated universal primer 
complementary to the 5' end of the PCR amplicon 5'-AGCG- 
GATAACAATTTCACACAGG-3 1 (SEQ ID NO.: 121). 
After an initial round of amplification with the target with 
the specific forward (long) and reverse primer (short), the 5' 
biotinylated universal primer then hybridized and acted as a 
reverse primer thereby introducing a 3' biotin capture moiety 
into the molecule. The amplification protocol results in a 
5 '-biotinylated double stranded DNA amplicon and dramati- 
cally reduces the cost of high throughput genotyping by 
eliminating the need to 5' biotin label each forward primer 
used in a genotyping. Thermal cycling was performed in 0.2 
mL tubes or 96 well plate using an MJ Research Thermal 
Cycler (Watham, Mass.) (calculated temperature) with the 
following cycling parameters: 94° C. for 5 min; 45 cycles: 
94° C. for 20 sec, 56° C. for 30 sec, 72° C. for 60 sec; 72° 
C. 3 min. 

[0217] Immobilization of DNA 

[0218] The 50 fA PCR reaction was added to 25 fA of 
streptavidin coated magnetic bead (Dynal, Lake Success, 
N.Y.) prewashed three times and resuspended in 1 M NH 4 CL 
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0.06 M NH 4 OH. The PCR amplicons were allowed to bind 
to the beads for 15 minutes at room temperature. The beads 
were then collected with a magnet and the supernatant 
containing unbound DNA was removed. The unbound 
strand was released from the double stranded amplicons by 
incubation in 100 mM NaOH and washing of the beads three 
times with 10 mM Tris pH 8.0. 

[0219] Genotyping 

[0220] The frequency of the alleles at position 2577 in the 
GPI-1 gene was measured using the MassEXTEND™ assay 
and MALDI-TOF. The SNP identified at position 2577 of 
GPI-1 in the GenBank sequence is represented as a G to A 
transversion. The MassEXTEND™ assay used detected this 
sequence, thus the SNP was represented as C to T in the 
primer extension products. The DNA coated magnetic beads 
were resuspended in 26 mM Tris-HCL pH 9.5, 6.5 mM 
MgCl 2 and 50 mM each of dTTPs and 50 mM each of 
ddCTP, ddATP, ddGTP, 2.5 U of a thermostable DNA 
polymerase (Amersham Pharmacia Biotech, Piscataway, 
N.J.) and 20 pmoles of a template specific oligonucleotide 
primer 5 AAGGGAGACAG ATTTGGC-3' (SEQ ID NO.: 
10) (Operon, Alameda, Calif.). Primer extension occurred 
with three cycles of oligonucleotide primer hybridization 
and extension. The extension products were analyzed after 
denaturation from the template with 50 mM NH 4 C1 and 
transfer of 150 nl each sample to a silicon chip preloaded 
with 150 nl of H3PA matrix material. The sample material 
was allowed to crystallize and analyzed by MALDI-TOF 
(Bruker Daltonics, Billerica, Mass.; PerSeptive, Foster City, 
Calif.). The mass of the primer used in the MassEXTEND™ 
reaction was 561 2.70 daltons. The predominant allele is 


extended by the addition of ddC, which has a mass of 
5885.90 daltons. The allelic variant results in the addition of 
dT and ddG to the primer to produce an extension product 
having a mass of 6230.10 daltons. 

[0221] In addition to being analyzed as a pool, each 
individual sample (0.5 ng) was amplified as described above 
and analyzed individually using the MassEXTEND™ reac- 
tion as described above. 

[0222] Pooled populations of women (200 women per 
pool) with low HDL (pool 3) showed an increase in the T 
allele of 11.33% at nucleotide position 2577 as compared 
with those with high levels of HDL (pool 4). The association 
of this allelic variant of the GPI-1 gene with low HDL gave 
a statistically significant value of 15.04 using a 1-degree- 
of-freedom chi-squared test of association. In other words, 
the increase of 16.23% to 27.57% is significant, with a p 
value of 0.0001064 (see FIG. 2). The genotype of each of 
the individuals in the pooled population was also determined 
by carrying out individual MassEXTEND™ reactions on 
individual DNA samples. These analysis confirmed the 
pooling data showing that there was an increase in the 
frequency of the T allele of 19.49% to 26.1%, (p=0.024). 
The measured genotypes in pool 3 showed a decrease in the 
homozygous CC genotype from 65.24% to 54.21% and an 
increase in the heterozygous CT genotype from 30.51% to 
39.25%. The homozygous TT genotypes increased 2.3%. 

[0223] Since modifications will be apparent to those of 
skill in this art, it is intended that this invention be limited 
only by the scope of the appended claims. 


SEQUENCE LISTING 


<160> NUMBER OF SEQ ID NOS : 122 

<210> SEQ ID NO 1 

<2 11> LENGTH: 43 9 

<212> TYPE: DNA 

<2 13> ORGAN ISM i Homo Sapien 

<22 0> FEATURE: 

<221> NAME /KEY : CDS 

<222> LOCATION: (45 J ... ( 305 ) 

<4 00> SEQUENCE: 1 

ttgagctgca ggttgaatcc ggggtgcctt taggattcag cacc atg gcg gaa gac 

Met Ala Glu Asp 


atg gag acc aaa ate aag aac tac aag ace gec cct ttt gac age cgc 
Met Glu Thr Lys lie Lys Asn Tyr Lys Thr Ala Pro Phe Asp Ser Arg 


ttc ccc aac cag aac cag act aga aac tgc tgg cag aac tac ctg gac 
Phe Pro Asn Gin Asn Gin Thr Arg Asn Cys Trp Gin Asn Tyr Leu Asp 


ttc esc cgc tgt cag aag gca atg acc get aaa gga ggc gat ate tct 
Phe His Arg Cys Gin Lys Ala Met Thr Ala Lys Gly Gly Asp lie Ser 


gtg tgc gaa tgg tac cag cgt gtg tac cag tec etc tgc ccc aca tec 
Val Cys Glu Trp Tyr Gin Arg Val Tyr Gin Ser Leu Cys Pro Thr Ser 
55 60 65 


248 
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-continued 


tgg gtc aca gac tgg gat gag caa egg get gaa ggc acg ttt ccc ggg 29 6 

Trp Val Thr Asp Trp Asp Glu Gin Arg Ala Glu Gly Thr Phe Pro Gly 
70 75 SO 

aag ate tga actggctgea tctccctttc ctctgtcctc catccttctc 345 
Lys lie * 
85 

ccaggatggt gaagggggac ctggtaccca gtgatcccca ccccaggatc ctaaatcatg 405 
acttacctgc taataaaaac tcattggaaa agtg 43 9 


<210> SEQ ID NO 2 

<211> LENGTH: 8 6 

<212> TYPE: PRT 

<213> ORGANISM: Homo Sapien 

<400> SEQUENCE: 2 

Met Ala Glu Asp Met Glu Thr Lys lie Lys Asn Tyr Lys Thr Ala Pro 

15 10 15 

Phe Asp Ser Arg Phe Pro Asn. Gin Asn Gin Thr Arg Asn Cys Trp Gin 
20 25 30 

Asn Tyr Leu Asp Phe Hie Arg Cys Gin Lys Ala Met Thr Ala Lye Gly 
35 40 45 

Gly Asp lie Ser Val Cys Glu Trp Tyr Gin Arg Val Tyr Gin Ser Leu 


Cys Pro Thr Ser Trp Val Thr Asp Trp Asp Glu Gin Arg Ala Glu Gly 
65 70 75 80 


Thr Phe Pro Gly Lys lie 
85 


<210> SEQ ID HO 3 
<211> LENGTH: 18 
<212> TYPE: DHA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION : PCR Primer 
<400> SEQUENCE: 3 

aggattcagc accatggc 18 


<210> SEQ ID HO 4 
<211> LENGTH: 4 3 
<212> TYPE: DHA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER IH FORMAT I OH : PCR Primer 
<4 0 0> SEQUENCE: 4 

ageggataac aatttcacac aggtagtctg gttctggttg ggg 4 3 


<210> SEQ ID HO 5 
<211> LENGTH: 18 
<212> TYPE: DHA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER IH FORMAT I ON : MassExtend primer 
<400> SEQUENCE: 5 

aatcaagaac tacaagac 18 


<210> SEQ ID HO 6 
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<211> LENGTH: 2 921 

<212> TYPE: DNA 

<213> ORGANISM: Homo Sapien 

<2 2 0> FEATURE: 

<221> NAME/KEY: CDS 

<222> LOCATION: ( 10 3 ) . . . ( 18 48 ) 

<400> SEQUENCE: 6 

cagcgagcgc cgtcgtctgc ccgggcccgc ccatcggggt cccGaacccc atccggac 

cgccgcccga gcgcgcggcc coggaagcac ccgcctcccg gc atg gtg etc aag 

Met Val Leu Lys 


gec ttc ttc ccc acg tgc tgc gte teg gcg gac age ggg ctg ctg gtg 
Ala Phe Phe Pro Thr Cys Cys Val Ser Ala Asp Ser Gly Leu Leu Val 


gga egg tgg gtg ccg gag cag age age gee gtg gtc ctg gcg gtc ctg 
Gly Arg Trp Val Pro Glu Gin Ser Ser Ala Val Val Leu Ala Val Leu 


cac ttt ccc ttc ate ccc ate cag gtc aag cag etc ctg gee cag gtg 
His Phe Pro Phe lie Pro lie Gin Val Lys Gin Leu Leu Ala Gin Val 


egg cag gec age cag gtg ggc gtg gec gtg ctg ggc acc tgg tgc cac 

Arg Gin Ala Ser Gin Val Gly Val Ala Val Leu Gly Thr Trp Cys His 


tgc egg cag gag ccc gag gag age ctg ggc egc ttc ctg gag age ctg 
Cys Arg Gin Glu Pro Glu Glu Ser Leu Gly Arg Phe Leu Glu Ser Leu 


ggt get gtc ttc ccc cat gag ccc tgg ctg egg ctg tgc egg gag aga 

Gly Ala Val Phe Pro His Glu Pro Trp Leu Arg Leu Cys Arg Glu Arg 

85 90 95 100 

ggc ggc acg ttc tgg age tgc gag gec acc cac egg caa gcg ccc act 

Gly Gly Thr Phe Trp Ser Cys Glu Ala Thr His Arg Gin Ala Pro Thr 

105 110 115 

gec ccc ggt gec cct ggt gag gae cag gtc atg etc ate ttc tat gac 

Ala Pro Gly Ala Pro Gly Glu. Asp Gin Val Met Leu lie Phe Tyr Asp 

120 125 130 

cag cgc cag gtg ttg ctg tea cag eta cac ctg ccc acc gtc ctg ccc 

Gin Arg Gin Val Leu Leu Ser Gin Leu His Leu Pro Thr Val Leu Pro 

135 140 145 

gac cgc cag get gga gec acc act gec age acg ggg ggc ctg get gec 

Asp Arg Gin Ala Gly Ala Thr Thr Ala Ser Thr Gly Gly Leu Ala Ala 

150 155 160 

gtc ttc gac acg gta gca cgc agt gag gtg etc ttc cgc agt gac cgc 

Val Phe Asp Thr Val Ala Arg Ser Glu Val Leu Phe Arg Ser Asp Arg 

165 170 175 180 

ttt gat gag ggc ccc gtg egg ctg age cac tgg cag teg gag ggc gtg 

Phe Asp Glu Gly Pro Val Arg Leu Ser His Trp Gin Ser Glu Gly Val 

185 190 195 

gag gec age ate etc gcg gag ctg gec agg cga gee teg gga ccc att 

Glu Ala Ser lie Leu Ala Glu Leu Ala Arg Arg Ala Ser Gly Pro lie 

200 205 210 

tgt etg ctg ttg gec age ctg ctg teg ctg gtc tea get gtc agt gec 

Cys Leu Leu Leu Ala Ser Leu Leu Ser Leu Val Ser Ala Val Ser Ala 

215 220 225 

tgc cga gtg tte aag etc tgg cec ctg tec ttc etc ggg age aaa etc 

Cys Arg Val Phe Lys Leu Trp Pro Leu Ser Phe Leu Gly Ser Lys Leu 

230 235 240 

tec acg tgc gaa cag etc egg cac egg ctg gag cac etc acg eta ate 
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-continued 

Ser Thr Cys Glu Gin Leu Arg His Arg Leu Glu His Leu Thr Leu lie 

245 250 255 260 

ttc agt aca egg aag gcg gag aac cct gec cag ctg atg agg aag gec 

Phe Ser Thr Arg Lys Ala Glu Asn Pro Ala Gin Leu Met Arg Lys Ala 
265 270 275 

aac acg gtg gec tct gtg ctg ctg gac gtg gec ctg ggc etc atg ctg 

Asn Thr Val Ala Ser Val Leu Leu Asp Val Ala Leu Gly Leu Met Leu 
280 285 290 

ctg tec tgg etc cac ggg aga age cgc ate ggg cat ctg gee gac gec 

Leu Ser Trp Leu His Gly Arg Ser Arg lie Gly His Leu Ala Asp Ala 

295 300 305 

etc gtt cct gtg get gac cac gtg gee gag gag etc cag cat ctg ctg 

Leu Val Pro Val Ala Asp His Val Ala Glu Glu Leu Gin His Leu Leu 

310 315 320 

cag tgg ctg atg ggt get ccc gee ggg etc aag atg aac cgt gca ctg 

Gin Trp Leu Met Gly Ala Pro Ala Gly Leu Lys Met Asn Arg Ala Leu 

325 330 335 340 

gac cag gtg ctg ggc cgc ttc ttc etc tac cac ate cac ctg tgg ate 

Asp Gin Val Leu Gly Arg Phe Phe Leu Tyr His lie His Leu Trp lie 
345 350 355 

age tac ate cac etc atg tec ccc ttc gtg gag cac ate ctt tgg cac 

Ser Tyr Tie His Leu Met Ser Pro Phe Val Glu His lie Leu Trp His 
360 365 370 

gtg ggc etc teg gec tgc ctg ggc ctg aeg gtg gec ctg tec etc etc 

Val Gly Leu Ser Ala Cys Leu Gly Leu Thr Val Ala Leu Ser Leu Leu 

375 380 385 

teg gac att ate gee etc etc ace ttc cac ate tac tgc ttt tac gtc 

Ser Asp lie lie Ala Leu Leu Thr Phe His lie Tyr Cys Phe Tyr Val 

390 395 400 

tat gga gee agg ctg tac tgc ctg aag ate cat ggc ctg tec tea ctg 

Tyr Gly Ala Arg Leu Tyr Cys Leu Lys lie His Gly Leu Ser Ser Leu 

405 410 415 420 

tgg cgt ctg ttc egg ggg aag aag tgg aac gtt ctg cgc cag cgc gtg 

Trp Arg Leu Phe Arg Gly Lys Lys Trp Asn Val Leu Arg Gin Arg Val 
425 430 435 

gac tec tgt tec tat gac ctg gac cag etg ttc ate ggg act ctg etc 

Asp Ser Cys Ser Tyr Asp Leu Asp Gin Leu Phe lie Gly Thr Leu Leu 
440 445 450 

ttc ace ate ctg etc ttc etc ctg cct ace aca gec ctg tac tac ctg 

Phe Thr lie Leu Leu Phe Leu Leu Pro Thr Thr Ala Leu Tyr Tyr Leu 

455 460 465 

gtg ttc acc ctg etc egg etc ctg gtg gtc gec gtg cag ggc ctg ate 

Val Phe Thr Leu Leu Arg Leu Leu Val Val Ala Val Gin Gly Leu lie 

470 475 480 

cat ctg ctg gtg gac etc ate aac tec ctg ccg ctg tac tea ctg ggt 

His Leu Leu Val Asp Leu lie Asn Ser Leu Pro Leu Tyr Ser Leu Gly 

485 490 495 500 

ctt egg etc tgc egg ccc tac agg ctg gcg get ggc gtg aag ttc cgt 

Leu Arg Leu Cys Arg Pro Tyr Arg Leu Ala Ala Gly Val Lys Phe Arg 
505 510 515 

gtc etc egg cac gag gec age agg ecc etc cgc etc ctg atg cag ata 

Val Leu Arg His Glu Ala Ser Arg Pro Leu Arg Leu Leu Met Gin lie 
520 525 530 

aac cca ctg ccc tac age cgc gtg gtg cac acc tac cgc etc ccc age 

Asn Pro Leu Pro Tyr Ser Arg Val Val His Thr Tyr Arg Leu Pro Ser 

535 540 545 

tgt ggc tgc cac ccc aag cac tec tgg ggc gec ctg tgc cgc aag ctg 
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Cys Gly Cys His Pro Lye His Ser Trp Gly Ala Leu Cys Arg Lys Leu 

550 555 5e0 

ttc ctt ggg gag etc ate tac ccc tgg agg cag aga ggg gac aag cag 1842 

Phe Leu Gly Glu Leu lie Tyr Pro Trp Arg Gin Arg Gly Asp Lys Gin 

565 570 575 580 

gac tga gggaactget ggctcgcctg gcaccaccac acggccacag ccagccatct 189 8 
Asp * 

gctctgccag ggtggcacca gctcagctgg cgcatgtccc gtgctttgtg gaegctgetg 195 8 

tgtgctcctg aacaeggcag gccctgctat cacaccttgg gcttggaggt cattgggagt 2018 

gagcagatgt gggggtggcc agecaggctg gccgcactcc atcactggca ctgcctgcct 2 07 8 

tgggacccgc ttcccacctg ctgcggtcac catggtggcg agcacagcaa ccccaggtgt 2 138 

ccagagcact gccccatgcc caccctgcat acccaggtcc agagggtccg tccaccacag 2 198 

cagccccagg tggagggctg gtctccctgg gggctcccca gtggctctgc cctggctgtg 225 8 

ggggtggagg gaccttgcca ggatgaaccc tccagtccca ggcaccctct agctccctca 2318 

gccgaacagc accctgcatc "tgggggattg aagcagtege tgaccoccgt ccccagcggg 2 37 8 

cccgggccct cactccctga accacacggg gtttatttgc ggatgttccc tggagaggtc 2 43 8 

gctttgtgaa gaaaccatca gcaggctgtg agcatcgcca ggctgctgtg ggggegggag 2 49 8 

cagcctcagt gtcaagggee tgcccactga cccagccgta cctattcgtc cacggtgccc 255 8 

egtagcagea ggtcctgegg ccaaatctgt ctcccttcat gggcctccca gggaaggagg 2618 

aagccctgct gtgeagacac ctctgtggcc ccccaggggt gtgageggee "tggggagggg 2 67 8 

gecg-bggeae tgaggccgaa agtgcctgcc agaeggcacg gtctggg-tgc ggg-bg-ttccc 2 73 8 

tgtgagcccg agtccgcttc aggaggggag ectgeaggtg ccggctggtg aggggatgac 2 79 8 

gcgctgtggg tgggaggagg cagcgcccat ctcagcagca ccaggactgc ctgggactcc 2 85 8 

ctggcaaccc ageacegggg aagccgtcag ctgctgtgac aataaaacct gccccgtgtc 2918 

tgg 2 92 1 

<210> SEQ ID NO 7 

<211> LENGTH: 581 

<212> TYPE: PRT 

<213> ORGANISM: Homo Sapien 

<4 0 0> SEQUENCE: 7 

Met Val Leu Lys Ala Phe Phe Pro Thr Cys Cys Val Ser Ala Asp Ser 

15 10 15 

Gly Leu Leu Val Gly Arg Trp Val Pro Glu Gin Ser Ser Ala Val Val 

20 25 30 

Leu Ala Val Leu His Phe Pro Phe lie Pro lie Gin Val Lys Gin Leu 

35 40 45 

Leu Ala Gin Val Arg Gin Ala Ser Gin Val Gly Val Ala Val Leu Gly 

50 55 60 

Thr Trp Cys His Cys Arg Gin Glu Pro Glu Glu Ser Leu Gly Arg Phe 

65 70 75 80 

Leu Glu Ser Leu Gly Ala Val Phe Pro His Glu Pro Trp Leu Arg Leu 

85 90 95 

Cys Arg Glu Arg Gly Gly Thr Phe Trp Ser Cys Glu Ala Thr His Arg 

100 105 110 

Gin Ala Pro Thr Ala Pro Gly Ala Pro Gly Glu Asp Gin Val Met Leu 
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He Phe Tyr Asp Gin Arg Gin. Veil Leu Leu Ser Gin Leu His Leu Pro 
130 135 140 

Thr Val Leu Pro Asp Arg Gin Ala Gly Ala Thr Thr Ala Ser Thr Gly 
145 150 155 160 

Gly Leu Ala Ala Val Phe Asp Thr Val Ala Arg Ser Glu Val Leu Phe 

165 170 175 

Arg Ser Asp Arg Phe Asp Glu Gly Pro Val Arg Leu Ser His Trp Gin 
180 185 190 

Ser Glu Gly Val Glu Ala Ser He Leu Ala Glu Leu Ala Arg Arg Ala 
195 200 205 

Ser Gly Pro He Cys Leu Leu Leu Ala Ser Leu Leu Ser Leu Val Ser 
210 215 220 

Ala Val Ser Ala Cys Arg Val Phe Lys Leu Trp Pro Leu Ser Phe Leu 
225 230 235 240 

Gly Ser Lys Leu Ser Thr Cys Glu Gin Leu Arg His Arg Leu Glu His 
245 250 255 

Leu Thr Leu He Phe Ser Thr Arg Lys Ala Glu Asn Pro Ala Gin Leu 
260 265 270 

Met Arg Lys Ala Asn Thr Val Ala Ser Val Leu Leu Asp Val Ala Leu 
275 280 285 

Gly Leu Met Leu Leu Ser Trp Leu His Gly Arg Ser Arg He Gly His 
290 295 300 

Leu Ala Asp Ala Leu Val Pro Val Ala Asp His Val Ala Glu Glu Leu 
305 310 315 320 

Gin His Leu Leu Gin Trp Leu Met Gly Ala Pro Ala Gly Leu Lye Met 
325 330 335 

Asn Arg Ala Leu Asp Gin Val Leu Gly Arg Phe Phe Leu Tyr His lie 
340 345 350 

His Leu Trp lie Ser Tyr lie His Leu Met Ser Pro Phe Val Glu His 
355 360 365 

lie Leu Trp His Val Gly Leu Ser Ala Cys Leu Gly Leu Thr Val Ala 
370 375 380 

Leu Ser Leu Leu Ser Asp He He Ala Leu Leu Thr Phe His He Tyr 
385 390 395 400 

Cys Phe Tyr Val Tyr Gly Ala Arg Leu Tyr Cys Leu Lys He His Gly 
405 410 415 

Leu Ser Ser Leu Trp Arg Leu Phe Arg Gly Lys Lys Trp Asn Val Leu 
420 425 430 

Arg Gin Arg Val Asp Ser Cys Ser Tyr Asp Leu Asp Gin Leu Phe He 
435 440 445 

Gly Thr Leu Leu Phe Thr He Leu Leu Phe Leu Leu Pro Thr Thr Ala 
450 455 460 

Leu Tyr Tyr Leu Val Phe Thr Leu Leu Arg Leu Leu Val Val Ala Val 
465 470 475 480 

Gin Gly Leu He His Leu Leu Val Asp Leu He Asn Ser Leu Pro Leu 
485 490 495 

Tyr Ser Leu Gly Leu Arg Leu Cys Arg Pro Tyr Arg Leu Ala Ala Gly 
500 505 510 

val Lys Phe Arg val Leu Arg His Glu Ala ser Arg Pro Leu Arg Leu 

515 520 525 
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Leu Met Gin lie Asn Pro Leu Pro Tyr Ser Arg Val Val His Thr Tyr 
530 535 540 

Arg Leu Pro Ser Cys Gly Cys His Pro Lys His Ser Trp Gly Ala Leu 
545 550 555 560 

Cys Arg Lys Leu Phe Leu Gly Glu Leu lie Tyr Pro Trp Arg Gin Arg 
565 570 575 

Gly Asp Lys Gin Asp 
5S0 


<210> SEQ ID NO 8 
<211> LENGTH: 18 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION : PCR primer 
<400> SEQUENCE: 8 

agcagggctt cctccttc 18 


<210> SEQ ID HO 9 

<211> LENGTH: 43 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<2 2 0> FEATURE : 

<223> OTHER INFORMATION : PCR primer 

<4 0 0> SEQUENCE: 9 

agcggataac aatttcacac aggtgaccca gccgtaccta ttc 4 3 


<210> SEQ ID NO 10 

<211> LENGTH: 18 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<2 2 0> FEATURE: 

<223> OTHER INFORMATION: MassExtend prijner 

<4 0 0> SEQUENCE: 10 

aagggagaca gatttggc 18 


<210> SEQ ID NO 11 

<211> LENGTH: 179 0 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapien 

<2 2 0> FEATURE: 

<221> NAME/KEY: CDS 

<222> LOCATION: ( 13 1 ) . . . ( 16 12 ) 

<223> OTHER INFORMATION: Nucleotide sequence encoding Cholesterol ester 
transfer protein (CETP) 

<400> SEQUENCE: 11 

gtgaatctct ggggccagga agaccctgct gcccggaaga gcctcatgtt ccgtgggggc 6 0 

tgggcggaca tacatatacg ggctccaggc tgaacggctc gggccactta cacaccactg 12 0 

cctgataacc atg ctg get gec aca gtc ctg acc ctg gec ctg ctg ggc 16 9 

Met Leu Ala Ala Thr Val Leu Thr Leu Ala Leu Leu Gly 
1 5 10 

aat gec cat gec tgc tec aaa ggc acc teg cac gag gca ggc ate gtg 217 
Asn Ala His Ala Cys Ser Lys Gly Thr Ser His Glu Ala Gly lie Val 
15 20 25 


tgc cgc ate acc aag cct gec etc ctg gtg ttg aac cac gag act gec 
Cys Arg lie Thr Lys Pro Ala Leu Leu Val Leu Asn His Glu Thr Ala 


265 
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aag gtg ate cag ace gec ttc cag cga gec age tac cca gat ate acg 
Lys Val lie Gin Thr Ala Phe Gin Arg Ala Ser Tyr Pro Asp He Thr 


ggc gag aag gec atg atg etc ctt ggc caa gtc aag tat ggg ttg cac 
Gly Glu Lys Ala Met Met Leu Leu Gly Gin Val Lys Tyr Gly Leu His 


aac ate cag ate age cac ttg tec ate gee age age cag gtg gag ctg 
Asn He Gin He Ser His Leu Ser He Ala Ser Ser Gin Val Glu Leu 


gtg gaa gec aag tec att gat gtc tec att cag aac gtg tct gtg gtc 

Val Glu Ala Lys Ser lie Asp Val Ser He Gin Asn Val Ser Val Val 

95 100 105 

ttc aag ggg acc ctg aag tat ggc tac acc act gec tgg tgg ctg ggt 

Phe Lys Gly Thr Leu Lys Tyr Gly Tyr Thr Thr Ala Trp Trp Leu Gly 

110 115 120 125 

att gat cag tec att gac ttc gag ate gac tct gec att gac etc cag 

lie Asp Gin Ser lie Asp Phe Glu He Asp Ser Ala He Asp Leu Gin 

130 135 140 

ate aac aca cag ctg acc tgt gac tct ggt aga gtg egg acc gat gec 

lie Asn Thr Gin Leu Thr Cys Asp Ser Gly Arg Val Arg Thr Asp Ala 

145 150 155 

cct gac tgc tac ctg tct ttc cat aag ctg etc ctg cat etc caa ggg 

Pro Asp Cys Tyr Leu Ser Phe His Lys Leu Leu Leu His Leu Gin Gly 

160 165 170 

gag cga gag cct ggg tgg ate aag cag ctg ttc aca aat ttc ate tec 

Glu Arg Glu Pre Gly Trp He Lys Gin Leu Phe Thr Asn Phe He Ser 

175 180 185 

ttc acc ctg aag ctg gtc ctg aag gga cag ate tgc aaa gag ate aac 

Phe Thr Leu Lys Leu Val Leu Lys Gly Gin He Cys Lys Glu He Asn 

190 195 200 205 

gtc ate tct aac ate atg gec gat ttt gtc cag aca agg get gee age 

Val He Ser Asn He Met Ala Asp Phe Val Gin Thr Arg Ala Ala Ser 

210 215 220 

ate ctt tea gat gga gac att ggg gtg gac att tec ctg aca ggt gat 

lie Leu Ser Asp Gly Asp He Gly Val Asp He Ser Leu Thr Gly Asp 

225 230 235 

ccc gtc ate aca gee tec tac ctg gag tec cat cac aag ggt cat ttc 

Pro Val lie Thr Ala Ser Tyr Leu Glu Ser His His Lys Gly His Phe 

240 245 250 

ate tac aag aat gtc tea gag gac etc ccc etc ccc acc ttc teg ccc 

He Tyr Lys Asn Val Ser Glu Asp Leu Pro Leu Pro Thr Phe Ser Pro 

255 260 265 

aca ctg ctg ggg gac tec cgc atg ctg tac ttc tgg ttc tct gag cga 

Thr Leu Leu Gly Asp Ser Arg Met Leu Tyr Phe Trp Phe Ser Glu Arg 

270 275 230 285 

gtc ttc cac teg ctg gec aag gta get ttc cag gat ggc cgc etc atg 

Val Phe His Ser Leu Ala Lys Val Ala Phe Gin Asp Gly Arg Leu Met 

290 295 300 

etc age ctg atg gga gac gag ttc aag gca gtg ctg gag acc tgg ggc 

Leu Ser Leu Met Gly Asp Glu Phe Lys Ala Val Leu Glu Thr Trp Gly 

305 310 315 

ttc aac acc aac cag gaa ate ttc caa gag gtt gtc ggc ggc ttc ccc 

Phe Asn Thr Asn Gin Glu He Phe Gin Glu Val Val Gly Gly Phe Pro 

320 325 330 


age cag gee caa gtc acc gtc cac tgc etc aag atg ccc aag ate tec 117 7 

Ser Gin Ala Gin Val Thr Val His Cys Leu Lys Met Pro Lys He Ser 
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tgc caa aac aag gga gtc gtg gtc aat tct tea gtg atg gtg aaa ttc 1225 
Cys Gin Asn Lys Gly Val Val Val Asn Eer Ser Val Met Val Lye Phe 
350 355 360 365 

etc ttt cca cgc cca gac cag caa cat tct gta get tac aca ttt gaa 127 3 

Leu Phe Pro Arg Pro Asp Gin. Gin His Ser Val Ala Tyr Thr Phe Glu 
370 375 330 

gag gat ate gtg act acc gtc cag gec tec tat tct aag aaa aag etc 132 1 

Glu Asp lie Val Thr Thr Val Gin Ala Ser Tyr Ser Lys Lys Lys Leu 
385 390 395 

ttc tta age etc ttg gat ttc cag att aca cca aag act gtt tec aac 136 9 

Phe Leu Ser Leu Leu Asp Phe Gin lie Thr Pro Lys Thr Val Ser Asn 
400 405 410 

ttg act gag age age tec gag tec ate cag age ttc ctg cag tea atg 1417 
Leu Thr Glu Ser Ser Ser Glu Ser lie Gin Ser Phe Leu Gin Ser Met 
415 420 425 

ate acc get gtg gge ate cct gag gtc atg tct egg etc gag gta gtg 1465 
lie Thr Ala Val Gly lie Pro Glu Val Met Ser Arg Leu Glu Val Val 
430 435 440 445 

ttt aca gec etc atg aac age aaa ggc gtg age etc ttc gac ate ate 1513 
Phe Thr Ala Leu Met Asn Ser Lys Gly Val Ser Leu Phe Asp lie lie 
450 455 460 

aac cct gag att ate act cga gat ggc ttc ctg ctg ctg cag atg gac 156 1 

Asn Pre Glu lie lie Thr Arg Asp Gly Phe Leu Leu Leu Gin Met Asp 
465 470 475 

ttt ggc ttc cct gag cac ctg ctg gtg gat ttc etc cag age ttg age 160 9 

Phe Gly Phe Pro Glu His Leu Leu Val Asp Phe Leu Gin Ser Leu Ser 
480 485 490 

tag aagtctccaa ggaggtcggg atggggcttg tagcagaagg caagcaccag 1662 

gctcacagct ggaaccctgg tgtctcctcc agcgtggtgg aagttgggtt aggagtaegg 1722 

agatggagat tggctcccaa ctcctcccta tcctaaaggc ccactggcat taaagtgctg 1782 

tatecaag 1790 

<210> SEQ ID NO 12 

<211> LENGTH : 493 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapien 

<4 0 0> SEQUENCE: 12 

Met Leu Ala Ala Thr Val Leu Thr Leu Ala Leu Leu Gly Asn Ala His 
15 10 15 

Ala Cys Ser Lys Gly Thr Ser His Glu Ala Gly lie Val Cys Arg lie 
20 25 30 

Thr Lys Pro Ala Leu Leu Val Leu Asn His Glu Thr Ala Lys Val lie 
35 40 45 

Gin Thr Ala Phe Gin Arg Ala Ser Tyr Pro Asp lie Thr Gly Glu Lys 
50 55 60 

Ala Met Met Leu Leu Gly Gin Val Lys Tyr Gly Leu His Asn lie Gin 
65 70 75 80 

lie Ser His Leu Ser lie Ala Ser Ser Gin Val Glu Leu Val Glu Ala 
85 90 95 

Lys Ser lie Asp Val Ser lie Gin Asn Val Ser Val Val Phe Lys Gly 
100 105 110 


Thr Leu Lys Tyr Gly Tyr Thr Thr Ala Trp Trp Leu Gly lie Asp Gin 
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Ser lie Asp Phe Glu lie Asp Ser Ala lie Asp Leu Gin lie Asn Thr 
130 135 140 

Gin Leu Thr Cys Asp Ser Gly Arg Val Arg Thr Asp Ala Pro Asp Cys 
145 150 155 160 

Tyr Leu Ser Phe His Lys Leu Leu Leu His Leu Gin Gly Glu Arg Glu 

165 170 175 

Pro Gly Trp lie Lys Gin Leu Phe Thr Asn Phe lie Ser Phe Thr Leu 
180 185 190 

Lys Leu Val Leu Lys Gly Gin lie Cys Lys Glu lie Asn Val lie Ser 
195 200 205 

Asn lie Met Ala Asp Phe Val Gin Thr Arg Ala Ala Ser lie Leu Ser 
210 215 220 

Asp Gly Asp He Gly Val Asp He Ser Leu Thr Gly Asp Pro Val He 
225 230 235 240 

Thr Ala Ser Tyr Leu Glu Ser His His Lys Gly His Phe He Tyr Lys 
245 250 255 

Asn Val Ser Glu Asp Leu Pro Leu Pro Thr Phe Ser Pro Thr Leu Leu 
260 265 270 

Gly Asp Ser Arg Met Leu Tyr Phe Trp Phe Ser Glu Arg Val Phe Hie 
275 280 285 

Ser Leu Ala Lys Val Ala Phe Gin Asp Gly Arg Leu Met Leu Ser Leu 
290 295 300 

Met Gly Asp Glu Phe Lye Ala Val Leu Glu Thr Trp Gly Phe Asn Thr 
305 310 315 320 

Asn Gin Glu He Phe Gin Glu Val Val Gly Gly Phe Pro Ser Gin Ala 
325 330 335 

Gin Val Thr Val His Cys Leu. Lys Met Pro Lys lie Ser Cys Gin Asn 
340 345 350 

Lys Gly Val Val Val Asn Ser Ser Val Met Val Lys Phe Leu Phe Pro 
355 360 365 

Arg Pro Asp Gin Gin His Ser Val Ala Tyr Thr Phe Glu Glu Asp lie 
370 375 380 

Val Thr Thr Val Gin Ala Ser Tyr Ser Lys Lys Lys Leu Phe Leu Ser 
385 390 395 400 

Leu Leu Asp Phe Gin lie Thr Pro Lys Thr Val Ser Asn Leu Thr Glu 
405 410 415 

Ser Ser Ser Glu Ser lie Gin Ser Phe Leu Gin Ser Met He Thr Ala 
420 425 430 

Val Gly lie Pro Glu Val Met Ser Arg Leu Glu Val Val Phe Thr Ala 
435 440 445 

Leu Met Asn Ser Lys Gly Val Ser Leu Phe Asp He He Asn Pro Glu 
450 455 460 

He He Thr Arg Asp Gly Phe Leu Leu Leu Gin Met Asp Phe Gly Phe 
465 470 475 480 

Pro Glu His Leu Leu Val Asp Phe Leu Gin Ser Leu Ser 
485 490 


<210> SEQ ID NO 13 

<211> LENGTHS 35 4 9 

<212> TYPE: DHA 

<213> ORGANISM: Homo sapien 
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<2 2 0> FEATURE: 

<221> NAME/KEY : CDS 

<222> LOCATION: ( 175 )...( 1602 ) 

<223> OTHER IK FORMATION : Nucleotide sequence encoding lipoprotein lipase 
(LPL) 

<4 0 0> SEQUENCE: 13 

cccctcttcc tcctcctcaa gggaaagctg cccacttcta gctgccctgc catccccttt 6 0 

aaagggcgac ttgctcagcg ccaaaccgcg gctccagccc tctccagcct ccggctcagc 120 

cggctcatca gtcggtccgc gccttgcagc tcctccagag ggacgcgccc ogag atg 17 7 

Met 

1 

gag age aaa gec ctg etc gtg ctg act ctg gec gtg tgg etc cag agt 225 

Glu Ser Lys Ala Leu Leu Val Leu Thr Leu Ala Val Trp Leu Gin Ser 

5 10 15 

ctg acc gec tec cgc gga ggg gtg gec gee gee gac caa aga aga gat 27 3 

Leu Thr Ala Ser Arg Gly Gly Val Ala Ala Ala Asp Gin Arg Arg Asp 

20 25 30 

ttt ate gac ate gaa agt aaa ttt gee eta agg acc cct gaa gac aca 32 1 

Phe He Asp He Glu Ser Lys Phe Ala Leu Arg Thr Pro Glu Asp Thr 

35 40 45 

get gag gac ant tgc cac etc att ccc gga gta gca gag tec gtg get 36 9 

Ala Glu Asp Thr Cys Hie Leu He Pro Gly Val Ala Glu Ser Val Ala 

50 55 60 65 

acc tgt cat ttc aat cac age age aaa acc ttc atg gtg ate cat ggc 417 

Thr Cys His Phe Asn His Ser Ser Lys Thr Phe Met Val He His Gly 

70 75 SO 

tgg acg gta aca gga atg tat gag agt tgg gtg cca aaa ctt gtg gee 465 

Trp Thr Val Thr Gly Met Tyr Glu Ser Trp Val Pro Lys Leu Val Ala 

85 90 95 

gec ctg tac aag aga gaa cca gac tec aat gtc att gtg gtg gac tgg 513 

Ala Leu Tyr Lys Arg Glu Pro Asp Ser Asn Val lie Val Val Asp Trp 

100 105 110 

etg tea egg get cag gag cat tac cca gtg tec gcg ggc tac acc aaa 56 1 

Leu Ser Arg Ala Gin Glu His Tyr Pro Val Ser Ala Gly Tyr Thr Lys 

115 120 125 

ctg gtg gga cag gat gtg gee egg ttt ate aac tgg atg gag gag gag 60 9 

Leu Val Gly Gin Asp Val Ala Arg Phe He Asn Trp Met Glu Glu Glu 

130 135 140 145 

ttt aac tac cct ctg gac aat gtc cat etc ttg gga tac age ctt gga 65 7 

Phe Asn Tyr Pro Leu Asp Asn Val His Leu Leu Gly Tyr Ser Leu Gly 

150 155 160 

gee cat get get ggc att gca gga agt ctg acc aat aag aaa gtc aac 705 

Ala His Ala Ala Gly lie Ala Gly Ser Leu Thr Asn Lys Lys Val Asn 

165 170 175 

aga att act ggc etc gat cca get gga cct aac ttt gag tat gca gaa 75 3 

Arg He Thr Gly Leu Asp Pro Ala Gly Pro Asn Phe Glu Tyr Ala Glu 

180 185 190 

gec ccg agt cgt ctt tct cct gat gat gca gat ttt gta gac gtc tta 80 1 

Ala Pro Ser Arg Leu Ser Pro Asp Asp Ala Asp Phe Val Asp Val Leu 

195 200 205 

cac aca ttc acc aga ggg tec cct ggt cga age att gga ate cag aaa 84 9 

His Thr Phe Thr Arg Gly Ser Pro Gly Arg Ser lie Gly He Gin Lys 

210 215 220 225 

cca gtt ggg cat gtt gac att tac ccg aat gga ggt act ttt cag cca 89 7 

Pro Val Gly His Val Asp lie Tyr Pro Asn Gly Gly Thr Phe Gin Pro 

230 235 240 
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gga tgt aac att gga gaa get ate cgc gtg att gca gag aga gga ctt 945 

Gly Cys Asn lie Gly Glu Ala lie Arg Val lie Ala Glu Arg Gly Leu 
245 250 255 

gga gat gtg gac cag eta gtg aag tgc tec oac gag cgc tec att cat 99 3 

Gly Asp Val Asp Gin Leu Val Lys Cys Ser His Glu Arg Ser lie His 
260 265 270 

etc ttc ate gac tct ctg ttg aat gaa gaa aat cca agt aag gec tac 1041 

Leu Phe lie Asp Ser Leu Leu Asn Glu Glu Asn Pro Ser Lys Ala Tyr 

275 230 285 

agg tgc agt tec aag gaa gec ttt gag aaa ggg etc tgc ttg agt tgt 108 9 

Arg Cys Ser Ser Lys Glu Ala Phe Glu Lys Gly Leu Cys Leu Ser Cys 
290 295 300 305 

aga aag aac cgc tgc aac aat ctg ggc tat gag ate aat aaa gtc aga 1137 

Arg Lys Asn Arg Cys Asn Asn Leu Gly Tyr Glu lie Asn Lys Val Arg 
310 315 320 

gee aaa aga age age aaa atg tac ctg aag act cgt tct cag atg ccc 1185 

Ala Lys Arg Ser Ser Lys Met Tyr Leu Lys Thr Arg Ser Gin Met Pro 
325 330 335 

tac aaa gtc tte cat tac caa gta aag att cat ttt tct ggg act gag 123 3 

Tyr Lys Val Phe His Tyr Gin Val Lys He His Phe Ser Gly Thr Glu 
340 345 350 

agt gaa ace eat acc aat eag gee ttt gag att tct ctg tat ggc acc 128 1 

Ser Glu Thr His Thr Asn Gin Ala Phe Glu lie Ser Leu Tyr Gly Thr 

355 360 365 

gtg gec gag agt gag aac ate cca tte act ctg cct gaa gtt tec aca 132 9 

Val Ala Glu Ser Glu Asn He Pro Phe Thr Leu Pro Glu Val Ser Thr 
370 375 330 385 

aat aag acc tac tec tte eta att tac aca gag gta gat att gga gaa 137 7 

Asn Lys Thr Tyr Ser Phe Leu He Tyr Thr Glu Val Asp He Gly Glu 
390 395 400 

eta etc atg ttg aag etc aaa tgg aag agt gat tea tac ttt age tgg 1425 

Leu Leu Met Leu Lys Leu Lys Trp Lys Ser Asp Ser Tyr Phe Ser Trp 
405 410 415 

tea gac tgg tgg age agt ccc ggc tte gec att cag aag ate aga gta 147 3 

Ser Asp Trp Trp Ser Ser Pro Gly Phe Ala He Gin Lys He Arg Val 
420 425 430 

aaa gca gga gag act cag aaa aag gtg ate tte tgt tct agg gag aaa 152 1 

Lys Ala Gly Glu Thr Gin Lys Lys Val He Phe Cys Ser Arg Glu Lys 

435 440 445 

gtg tct cat ttg cag aaa gga aag gca cct gcg gta ttt gtg aaa tgc 156 9 

Val Ser His Leu Gin Lys Gly Lys Ala Pro Ala Val Phe Val Lys Cys 
450 455 4S0 465 

cat gac aag tct ctg aat aag aag tea ggc tga aactgggega atctacagaa 1622 

His Asp Lys Ser Leu Asn Lys Lys Ser Gly * 
470 475 

caaagaaegg catgtgaatt ctgtgaagaa tgaagtggag gaagtaactt ttacaaaaca 1682 

tacccagtgt ttggggtgtt tcaaaagtgg attttcctga atattaatcc cagccctacc 1742 

cttgttagtt attttaggag acagtctcaa gcactaaaaa gtggctaatt eaatttatgg 1802 

ggtatagtgg ccaaatagca catcctccaa cgttaaaaga cagtggatca tgaaaagtgc 1862 

tgttttgtcc tttgagaaag aaataattgt ttgagcgcag agtaaaataa ggctccttca 1922 

tgtggcgtat tgggceatag cntataattg gttagaaect cetattttaa ttggaattct 1982 

ggatctttcg gaetgaggee ttctcaaact ttactctaag tctccaagaa tacagaaaat 2 042 


gcttttccgc ggcacgaatc agactcatct acacagcagt atgaatgatg ttttagaatg 2 102 
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attccctctt gctattggaa tgtggtccag acgtcaacca ggaacatgta acttggagag 2 162 

ggacgaagaa agggtctgat aaacacagag gttttaaaca gtccctaccd ttggcctgca 2222 

tcatgacaaa gttacaaatt caaggagata taaaatctag atcaattaat tcttaatagg 2282 

ctttatcgtt tattgcttaa tccctctctc ccccttcttt tttgtctcaa gattatatta 2 342 

taataatgtt ctctgggtag gtgttgaaaa tgagcctgta atcctcagct gacacataat 2 402 

ttgaatggtg cagaaaaaaa aaagataccg taattttatt attagattct ccaaatgatt 2 462 

ttcatcaatt taaaatcatt caatatctga cagttantct tcagttttag gcttaccttg 2522 

gtcatgcttc agttgtactt ccagtgcgtc tcttttgttc ctggctttga catgaaaaga 2582 

taggtttgag ttcaaatttt gcattgtgtg agcttctaca gattttagac aaggaccgtt 2 642 

tttactaagt aaaagggtgg agaggttcct ggggtggatt cctaagcagt gcttgtaaac 2 702 

catcgcgtgc aatgagccag atggagtacc atgagggttg ttatttgttg tttttaacaa 2 762 

ctaatcaaga gtgagtgaao aactatttat aaactagatc tcctattttt cagaatgctc 2 822 

ttctacgtat aaatatgaaa tgataaagat gtcaaatatc tcagaggcta tagctgggaa 2 882 

cccgactgtg aaagtatgtg atatctgaac acatactaga aagctctgca t gtgtgttgt 2 942 

GGttcagcat aattcggaag ggaaaacagt cgatcaaggg atgtattgga acatgtcgga 3 002 

gtagaaattg ttcctgatgt gccagaactt cgaccctttc tctgagagag atgatcgtgc 3 062 

ctataaatag taggaccaat gttgtgatta acatcatcag gcttggaatg aattctctct 3 122 

aaaaataaaa tgatgtatga tttgttgttg gcatcccctt tattaattca ttaaatttct 3 182 

ggatttgggt tgtgacccag ggtgcattaa cttaaaagat tcactaaagc agcacatagc 3242 

actgggaact ctggctccga aaaactttgt tatatatatc aaggatgttc tggctttaca 3 302 

ttttatttat tagctgtaaa tacatgtgtg gatgtgtaaa tggagcttgt acatattgga 3 362 

aaggtcattg tggctatctg catttataaa tgtgtggtgc taactgtatg tgtctttatc 3 42 2 

agtgatggtc tcacagagcc aactcactct tatgaaatgg gctttaacaa aacaagaaag 3 482 

aaacgtactt aactgtgtga agaaatggaa tcagctttta ataaaattga caacatttta 3542 

ttaccac 3549 

<210> SEQ ID HO 14 

<211> LENGTH: 4 75 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapien 

<4 0 0> SEQUENCE : 14 

Met Glu Ser Lys Ala Leu Leu Val Leu Thr Leu Ala Val Trp Leu Gin 

15 10 15 

Ser Leu Thr Ala Ser Arg Gly Gly Val Ala Ala Ala Asp Gin Arg Arg 
20 25 30 

Asp Phe lie Asp lie Glu Ser Lys Phe Ala Leu Arg Thr Pro Glu Asp 
35 40 45 

Thr Ala Glu Asp Thr Cys His Leu lie Pro Gly Val Ala Glu Ser Val 
50 55 60 

Ala Thr Cys His Phe Asn His Ser Ser Lys Thr Phe Met Val lie His 
65 70 75 80 

Gly Trp Thr Val Thr Gly Met Tyr Glu Ser Trp Val Pro Lys Leu Val 
85 90 95 
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Ala Ala Leu Tyr Lys Arg Glu Pro Asp Ser Asn Val lie Val Val Asp 
100 105 110 

Trp Leu Ser Arg Ala Gin Glu His Tyr Pro Val Ser Ala Gly Tyr Thr 
115 120 125 

Lys Leu Val Gly Gin Asp Val Ala Arg Phe He Asn Trp Met Glu Glu 
130 135 140 

Glu Phe Asn Tyr Pro Leu Asp Asn Val His Leu Leu Gly Tyr Ser Leu 
145 150 155 160 

Gly Ala His Ala Ala Gly lie Ala Gly Ser Leu Thr Asn Lys Lys Val 
165 170 175 

Asn Arg lie Thr Gly Leu Asp Pro Ala Gly Pro Asn Phe Glu Tyr Ala 
ISO 185 190 

Glu Ala Pro Ser Arg Leu Ser Pro Asp Asp Ala Asp Phe Val Asp Val 
195 200 205 

Leu His Thr Phe Thr Arg Gly Ser Pro Gly Arg Ser He Gly lie Gin 

210 215 220 

Lys Pro Val Gly His Val Asp He Tyr Pro Asn Gly Gly Thr Phe Gin 
225 230 235 240 

Pro Gly Cys Asn lie Gly Glu Ala He Arg Val lie Ala Glu Arg Gly 

245 250 255 

Leu Gly Asp Val Asp Gin Leu Val Lys Cys Ser His Glu Arg Ser He 
260 265 270 

His Leu Phe He Asp Ser Leu Leu Asn Glu Glu Asn Pro Ser Lys Ala 
275 2S0 235 

Tyr Arg Cys Ser Ser Lys Glu Ala Phe Glu Lys Gly Leu Cys Leu Ser 
290 295 300 

Cys Arg Lys Asn Arg Cys Asn Asn Leu Gly Tyr Glu He Asn Lys Val 
305 310 315 320 

Arg Ala Lys Arg Ser Ser Lys Met Tyr Leu Lys Thr Arg Ser Gin Met 
325 330 335 

Pro Tyr Lys Val Phe His Tyr Gin Val Lys He His Phe Ser Gly Thr 
340 345 350 

Glu Ser Glu Thr His Thr Asn Gin Ala Phe Glu lie Ser Leu Tyr Gly 
355 360 365 

Thr Val Ala Glu Ser Glu Abu He Pro Phe Thr Leu Pro Glu Val Ser 
370 375 380 

Thr Asn Lys Thr Tyr Ser Phe Leu He Tyr Thr Glu Val Asp He Gly 
385 390 395 400 

Glu Leu Leu Met Leu Lys Leu Lys Trp Lys Ser Asp Ser Tyr Phe Ser 
405 410 415 

Trp Ser Asp Trp Trp Ser Ser Pro Gly Phe Ala lie Gin Lys He Arg 
420 425 430 

Val Lys Ala Gly Glu Thr Gin Lys Lys Val He Phe Cys Ser Arg Glu 

435 440 445 

Lys Val Ser His Leu Gin Lys Gly Lys Ala Pro Ala Val Phe Val Lys 
450 455 460 

Cys His Asp Lys Ser Leu Asn Lys Lys Ser Gly 
465 470 475 


<210> SEQ ID HO 15 
<211> LEMGTH: 146 6 
<212> TYPE: DKA 
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<213> ORGANISM: Homo sapien 

<2 2 0> FEATURE: 

<221> NAME/KEY : CDS 

<222> LOCATION: ( 115 )...( 1305 ) 

<223> OTHER IK FORMAT I OK : Nucleotide sequence encoding apolipoprotein 
A— IV (AEOA4) 

<400> SEQUENCE: 15 

agttcccact gcagcgcagg tgagctctcc tgaggacctc tctgtcagct cccctgattg 6 0 

tagggaggca tccagtgtgg caagaaactc ctccagccca gcaagcagct cagg atg 117 

Met 


ttc ctg aag gcc gtg gtc ctg acc ctg gec ctg gtg get gtc gec gga 
Phe Leu Lys Ala Val Val Leu Thr Leu Ala Leu Val Ala Val Ala Gly 


gec agg get gag gtc agt get gac cag gtg gee aca gtg atg tgg gac 
Ala Arg Ala Glu Val Ser Ala Asp Gin Val Ala Thr Val Met Trp Asp 


tac ttc age cag ctg age aac aat gec aag gag gec gtg gaa cat etc 26 1 

Tyr Phe Ser Gin Leu Ser Asn Asn Ala Lys Glu Ala Val Glu His Leu 
35 40 45 


cag aaa tct gaa etc acc cag caa etc aat gec etc ttc cag gac aaa 30 9 

Gin Lys Ser Glu Leu Thr Gin Gin Leu Asn Ala Leu Phe Gin Asp Lys 
50 55 SO 65 


ctt gga gaa gtg aac act tac gca ggt gac ctg cag aag aag ctg gtg 
Leu Gly Glu Val Asn Thr Tyr Ala Gly Asp Leu Gin Lys Lys Leu Val 


ccc ttt gec acc gag ctg cat gaa cgc ctg gec aag gac teg gag aaa 
Pro Phe Ala Thr Glu Leu His Glu Arg Leu Ala Lye Asp Ser Glu Lys 


ctg aag gag gag att ggg aag gag ctg gag gag ctg agg gec egg ctg 
Leu Lys Glu Glu lie Gly Lys Glu Leu Glu Glu Leu Arg Ala Arg Leu 
100 105 110 


ctg ccc cat gec aat gag gtg age cag aag ate ggg gac aac ctg cga 
Leu Pre His Ala Asn Glu Val Ser Gin Lys lie Gly Asp Asn Leu Arg 
115 120 125 


gag ctt cag cag cgc ctg gag ccc tac gcg gac cag ctg cgc acc cag 54 9 

Glu Leu Gin Gin Arg Leu Glu Pro Tyr Ala Asp Gin Leu Arg Thr Gin 
130 135 140 145 


gtc aac acg cag gec gag cag ctg egg cgc cag ctg acc ccc tac gca 
Val Asn Thr Gin Ala Glu Gin Leu Arg Arg Gin Leu Thr Pro Tyr Ala 
150 155 160 


cag cgc atg gag aga gtg ctg egg gag aac gec gac age ctg cag gec 
Gin Arg Met Glu Arg Val Leu Arg Glu Asn Ala Asp Ser Leu Gin Ala 
165 170 175 


teg ctg agg ccc cac gec gac gag etc aag gec aag ate gac cag aac 69 3 

Ser Leu Arg Pro His Ala Asp Glu Leu Lys Ala Lys lie Asp Gin Asn 
180 185 190 


gtg gag gag etc aag gga cgc ctt acg ccc tac get gac gaa ttc aaa 74 1 

Val Glu Glu Leu Lys Gly Arg Leu Thr Pro Tyr Ala Asp Glu Phe Lys 
195 200 205 


gtc aag att gac cag acc gtg gag gag ctg cgc cgc age ctg get ccc 78 9 

Val Lys He Asp Gin Thr Val Glu Glu Leu Arg Arg Ser Leu Ala Pro 
210 215 220 225 


tat get cag gac acg cag gag aag etc aac cac cag ctt gag ggc ctg 
Tyr Ala Gin Asp Thr Gin Glu Lys Leu Asn His Gin Leu Glu Gly Leu 

230 235 240 


acc ttc cag atg aag aag aac gec gag gag etc aag gec agg ate teg 


885 
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Thr Phe Gin Met Lys Lye Abu Ala Glu Glu Leu Lys Ala Arg lie Ser 

245 250 255 

gcc agt gcc gag gag ctg egg cag agg ctg gcg ccc ttg gec gag gac 93 3 

Ala Ser Ala Glu Glu Leu Arg Gin Arg Leu Ala Pro Leu Ala Glu Asp 
260 265 270 

gtg cgt ggc aac ctg agg ggc aac acc gag ggg ctg cag aag tea ctg 98 1 

Val Arg Gly Asn Leu Arg Gly Asn Thr Glu Gly Leu Gin Lys Ser Leu 
275 280 285 

gca gag ctg ggt ggg cac ctg gac cag cag gtg gag gag ttc cga cgc 102 9 

Ala Glu Leu Gly Gly His Leu Asp Gin Gin Val Glu Glu Phe Arg Arg 
290 295 300 305 

egg gtg gag ccc tac ggg gaa aac ttc aac aaa gcc ctg gtg cag cag 107 7 

Arg Val Glu Pro Tyr Gly Glu Asn Phe Asn Lys Ala Leu Val Gin Gin 
310 315 320 

atg gaa cag etc agg acg aaa ctg ggc ccc cat gcg ggg gac gtg gaa 1125 
Met Glu Gin Leu Arg Thr Lys Leu Gly Pro His Ala Gly Asp Val Glu 
325 330 335 

ggc cac ttg age ttc ctg gag aag gac ctg agg gac aag gtc aac tec 1173 
Gly His Leu Ser Phe Leu Glu Lys Asp Leu Arg Asp Lys Val Asn Ser 
340 345 350 

ttc ttc age acc ttc aag gag aaa gag age cag gac aag act etc tec 122 1 

Phe Phe Ser Thr Phe Lys Glu Lys Glu Ser Gin Asp Lys Thr Leu Ser 
355 360 365 

etc cat gag ctg gag naa cag cag gaa cag cat cag gag cag cag cag 1269 
Leu Pro Glu Leu Glu Gin Gin Gin Glu Gin His Gin Glu Gin Gin Gin 
370 375 3S0 385 

gag cag gtg cag atg ctg gcc cct ttg gag age tga gctgcccctg 1315 
Glu Gin Val Gin Met Leu Ala Pro Leu Glu Ser * 
390 395 

gtgcactggc cccaccctcg tggacacctg ccctgccctg ccacctgtct gtctgtccca 1375 

aagaagttct ggtatgaact tgaggacaca tgtccagtgg gaggtgagac cacctotcaa 1435 

tattcaataa agetgetgag aatctageet c 146 6 

<210> SEQ ID HO 16 

<211> LENGTH: 396 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapien 

<4 0 0> SEQUENCE: 16 

Met Phe Leu Lys Ala Val Val Leu Thr Leu Ala Leu Val Ala Val Ala 

15 10 15 

Gly Ala Arg Ala Glu Val Ser Ala Asp Gin Val Ala Thr Val Met Trp 
20 25 30 

Asp Tyr Phe Ser Gin Leu Ser Asn Asn Ala Lys Glu Ala Val Glu His 
35 40 45 

Leu Gin Lys Ser Glu Leu Thr Gin Gin Leu Asn Ala Leu Phe Gin Asp 
50 55 60 

Lys Leu Gly Glu Val Asn Thr Tyr Ala Gly Asp Leu Gin Lys Lys Leu 
65 70 75 80 

Val Pro Phe Ala Thr Glu Leu His Glu Arg Leu Ala Lys Asp Ser Glu 
85 90 95 

Lys Leu Lys Glu Glu lie Gly Lys Glu Leu Glu Glu Leu Arg Ala Arg 
100 105 110 


Leu Leu Pro His Ala Asn Glu val ser Gin Lys lie Gly Asp Asn Leu 
115 120 125 


US 2003/0224418 Al Dec. 4, 2003 

40 


-continued 


Arg Glu Leu. Gin Gin Arg Leu Glu Pro Tyr Ala Asp Gin Leu Arg Thr 
130 135 140 

Gin Val Asn Thr Gin Ala Glu Gin Leu Arg Arg Gin Leu Thr Pro Tyr 
145 150 155 160 

Ala Gin Arg Met Glu Arg Val Leu Arg Glu Asn Ala Asp Ser Leu Gin 
165 170 175 

Ala Ser Leu Arg Pro Hie Ala Asp Glu Leu Lys Ala Lys lie Asp Gin 
ISO 185 190 

Asn Val Glu Glu Leu Lys Gly Arg Leu Thr Pro Tyr Ala Asp Glu Phe 
195 200 205 

Lys Val Lys lie Asp Gin Thr Val Glu Glu Leu Arg Arg Ser Leu Ala 
210 215 220 

Pro Tyr Ala Gin Asp Thr Gin Glu Lys Leu Asn His Gin Leu Glu Gly 
225 230 235 240 

Leu Thr Phe Gin Met Lys Lys Asn Ala Glu Glu Leu Lys Ala Arg lie 
245 250 255 

Ser Ala Ser Ala Glu Glu Leu Arg Gin Arg Leu Ala Pro Leu Ala Glu 
260 265 270 

Asp Val Arg Gly Asn Leu Arg Gly Asn Thr Glu Gly Leu Gin Lys Ser 
275 280 285 

Leu Ala Glu Leu Gly Gly His Leu Asp Gin Gin Val Glu Glu Phe Arg 
290 295 300 

Arg Arg Val Glu Pro Tyr Gly Glu Asn Phe Asn Lys Ala Leu Val Gin 
305 310 315 320 

Gin Met Glu Gin Leu Arg Thr Lys Leu Gly Pro His Ala Gly Asp Val 
325 330 335 

Glu Gly His Leu Ser Phe Leu Glu Lys Asp Leu Arg Asp Lys Val Asn 
340 345 350 

Ser Phe Phe Ser Thr Phe Lys Glu Lys Glu Ser Gin Asp Lys Thr Leu 
355 360 365 

Ser Leu Pro Glu Leu Glu Gin Gin Gin Glu Gin His Gin Glu Gin Gin 

370 375 380 

Gin Glu Gin Val Gin Met Leu Ala Pro Leu Glu Ser 
385 390 395 


<210> SEQ ID HO 17 

<211> LENGTH : 115 6 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapien 

<220> FEATURE: 

<221> NAME/KEY: CDS 

<222> LOCATION: (61) (1014) 

<223> OTHER INFORMATION: Nucleotide Sequence encoding apolipoprotein : 
(APOE) 

<400> SEQUENCE: 17 

cgcagcggag gtgaaggacg tccttcccca ggagccgact ggccaatcac aggcaggaag 

atg aag gtt ctg tgg get gcg ttg ctg gtc aca ttc ctg gca gga tgc 
Met Lys Val Leu Trp Ala Ala Leu Leu Val Thr Phe Leu Ala Gly Cys 
15 10 15 

cag gec aag gtg gag caa gcg gtg gag aca gag ccg gag ccc gag ctg 
Gin Ala Lys Val Glu Gin Ala Val Glu Thr Glu Pro Glu Pro Glu Leu 


cgc cag cag acc gag tgg cag age ggc cag cgc tgg gaa ctg gca ctg 


204 
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Arg Gin Gin Thr Glu Trp Gin Ser Gly Gin Arg Trp Glu Leu Ala Leu 

35 40 45 

ggt cgc ttt tgg gat tac ctg cgc tgg gtg cag aca ctg tct gag cag 252 

Gly Arg Ehe Trp Asp Tyr Leu Arg Trp Val Gin Thr Leu Ser Glu Gin 

50 55 60 

gtg cag gag gag ctg etc age tec cag gtc acc cag gaa ctg agg gcg 30 0 

Val Gin Glu Glu Leu Leu Ser Ser Gin Val Thr Gin Glu Leu Arg Ala 

65 70 75 80 

ctg atg gac gag acc atg aag gag ttg aag gec tac aaa teg gaa ctg 34 8 

Leu Met Asp Glu Thr Met Lys Glu Leu Lys Ala Tyr Lye Ser Glu Leu 

85 90 95 

gag gaa caa ctg acc ccg gtg gcg gag gag acg egg gca egg ctg tec 39 6 

Glu Glu Gin Leu Thr Pro Val Ala Glu Glu Thr Arg Ala Arg Leu Ser 

100 105 110 

aag gag ctg cag gcg gcg cag gec egg ctg ggc gcg gac atg gag gac 44 4 

Lye Glu Leu Gin Ala Ala Gin Ala Arg Leu Gly Ala Asp Met Glu Asp 

115 120 125 

gtg tgc ggc cgc ctg gtg cag tac cgc ggc gag gtg cag gec atg etc 492 

Val Cys Gly Arg Leu Val Gin Tyr Arg Gly Glu Val Gin Ala Met Leu 

130 135 140 

ggc cag age acc gag gag ctg egg gtg cgc etc gee tec cac ctg cgc 54 0 

Gly Gin Ser Thr Glu Glu Lsu Arg Val Arg Leu Ala Ser His Leu Arg 

145 150 155 160 

aag ctg cgt aag egg etc etc cgc gat gec gat gac ctg cag aag cgc 58 8 

Lys Leu Arg Lys Arg Leu Leu Arg Asp Ala Asp Asp Leu Gin Lys Arg 

165 170 175 

ctg gca gtg tac cag gec ggg gec cgc gag ggc gee gag cgc ggc etc 63 6 

Leu Ala Val Tyr Gin Ala Gly Ala Arg Glu Gly Ala Glu Arg Gly Leu 

ISO 185 190 

age gee ate cgc gag cgc ctg ggg ccc ctg gtg gaa cag ggc cgc gtg 68 4 

Ser Ala lie Arg Glu Arg Leu Gly Pro Leu Val Glu Gin Gly Arg Val 

195 200 205 

egg gec gee act gtg ggc tec ctg gec ggc cag ccg eta cag gag egg 732 

Arg Ala Ala Thr Val Gly Ser Leu Ala Gly Gin Pro Leu Gin Glu Arg 

210 215 220 

gee cag gee tgg ggc gag egg ctg cgc gcg egg atg gag gag atg ggc 78 0 

Ala Gin Ala Trp Gly Glu Arg Leu Arg Ala Arg Met Glu Glu Met Gly 

225 230 235 240 

age egg acc cgc gac cgc ctg gac gag gtg aag gag cag gtg gcg gag 82 8 

Ser Arg Thr Arg Asp Arg Leu Asp Glu Val Lys Glu Gin Val Ala Glu 

245 250 255 

gtg cgc gec aag ctg gag gag cag gee cag cag ata cgc ctg cag gec 876 

Val Arg Ala Lys Leu Glu Glu Gin Ala Gin Gin lie Arg Leu Gin Ala 

260 265 270 

gag gec ttc cag gec cgc etc aag age tgg ttc gag ccc ctg gtg gaa 92 4 

Glu Ala Phe Gin Ala Arg Leu Lys Ser Trp Phe Glu Pro Leu Val Glu 

275 280 285 

gac atg cag cgc cag tgg gee ggg ctg gtg gag aag gtg cag get gee 972 

Asp Met Gin Arg Gin Trp Ala Gly Leu Val Glu Lys Val Gin Ala Ala 

290 295 300 

gtg ggc acc age gec gec cct gtg ccc age gac aat cac tga 1014 

Val Gly Thr Ser Ala Ala Pro Val Pro Ser Asp Asn His * 

305 310 315 

acgccgaagc ctgcagccat gcgaccccac gccaccccgt gcctcctgcc tccgcgcagc 1074 

ctgcagcggg agaccctgtc cccgccccag ccgtcctcct ggggtggacc ctagtttaat 1134 

aaagattcac caagtttcac gc 115 6 
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<210> SEQ ID HO 18 

<211> LENGTH: 317 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapien 

<4 0 0> SEQUENCE: 13 

Met Lys Val Leu Trp Ala Ala Leu Leu Val Thr Phe Leu Ala Gly Cys 
15 10 15 

Gin Ala Lys Val Glu Gin Ala Val Glu Thr Glu Pro Glu Pro Glu Leu 
20 25 30 

Arg Gin Gin Thr Glu Trp Gin Ser Gly Gin Arg Trp Glu Leu Ala Leu 
35 40 45 

Gly Arg Phe Trp Asp Tyr Leu Arg Trp Val Gin Thr Leu Ser Glu Gin 
50 55 60 

Val Gin Glu Glu Leu Leu Ser Ser Gin Val Thr Gin Glu Leu Arg Ala 


Leu Met Asp Glu Thr Met Lys Glu Leu Lys Ala Tyr Lys Ser Glu Leu 
85 90 95 

Glu Glu Gin Leu Thr Pro Val Ala Glu Glu Thr Arg Ala Arg Leu Ser 
100 105 110 

Lye Glu Leu Gin Ala Ala Gin Ala Arg Leu Gly Ala Asp Met Glu Asp 
115 120 125 

Val Cys Gly Arg Leu Val Gin Tyr Arg Gly Glu Val Gin Ala Met Leu 
130 135 140 

Gly Gin Ser Thr Glu Glu Leu Arg Val Arg Leu Ala Ser His Leu Arg 
145 150 155 160 

Lys Leu Arg Lys Arg Leu Leu Arg Asp Ala Asp Asp Leu Gin Lys Arg 
165 170 175 

Leu Ala Val Tyr Gin Ala Gly Ala Arg Glu Gly Ala Glu Arg Gly Leu 
ISO 185 190 

Ser Ala lie Arg Glu Arg Leu Gly Pro Leu Val Glu Gin Gly Arg Val 
195 200 205 

Arg Ala Ala Thr Val Gly Ser Leu Ala Gly Gin Pro Leu Gin Glu Arg 
210 215 220 

Ala Gin Ala Trp Gly Glu Arg Leu Arg Ala Arg Met Glu Glu Met Gly 
225 230 235 240 

Ser Arg Thr Arg Asp Arg Leu Asp Glu Val Lys Glu Gin Val Ala Glu 
245 250 255 

Val Arg Ala Lys Leu Glu Glu Gin Ala Gin Gin lie Arg Leu Gin Ala 
260 265 270 

Glu Ala Phe Gin Ala Arg Leu Lys Ser Trp Phe Glu Pro Leu Val Glu 
275 280 285 

Asp Met Gin Arg Gin Trp Ala Gly Leu Val Glu Lys Val Gin Ala Ala 
290 295 300 

Val Gly Thr Ser Ala Ala Pro Val Pro Ser Asp Asn His 
305 310 315 


<210> SEQ ID HO 19 

<211> LENGTH: 160 3 

<212> TYPE: DHA 

<213> ORGANISMS Homo sapien 

<2 2 0> FEATURE: 

<221> WAME/KEY: CDS 
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<222> LOCATION: (58 ) (155 7) 

<223> OTHER INFORMATION : nucleotide sequence encoding hepatic lipase 
(LIPC) 

<4 0 0> SEQUENCE: 19 

ggtctctttg gcttaagaaa ttaccaagaa agcctggacc ccgggtgaaa cggagaa atg 

Met 


gac aca agt ccc ctg tgt ttc tec att ctg ttg gtt tta tgc ate ttt 
Asp Thr Ser Pro Leu Cys Phe Ser lie Leu Leu Val Leu Cys lie Phe 


ate caa tea agt gee ctt gga caa age ctg aaa cca gag cca ttt gga 

lie Gin Ser Ser Ala Leu Gly Gin Ser Leu Lys Pro Glu Pro Phe Gly 


aga aga get caa get gtt gaa aca aac aaa acg ctg cat gag atg aag 
Arg Arg Ala Gin Ala Val Glu Thr Asn Lys Thr Leu His Glu Met Lys 


acc aga ttc ctg etc ttt gga gaa ace aat cag ggc tgt cag att cga 
Thr Arg Phe Leu Leu Phe Gly Glu Thr Asn Gin Gly Cys Gin lie Arg 


ate aat cat ccg gac acg tta cag gag tgc ggc ttc aac tec tec ctg 
lie Asn His Pro Asp Thr Leu Gin Glu Cys Gly Phe Asn Ser Ser Leu 
70 75 80 

cct ctg gtg atg ata ate cac ggg tgg teg gtg gac ggc gtg eta gaa 

Pro Leu Val Met He He His Gly Trp Ser Val Asp Gly Val Leu Glu 


aac tgg ate tgg cag atg gtg gec gcg ctg aag tct cag ccg gee cag 

Asn Trp He Trp Gin Met Val Ala Ala Leu Lys Ser Gin Pro Ala Gin 

100 105 110 

cca gtg aac gtg ggg ctg gtg gac tgg ate acc ctg gec cac gac cac 

Pro Val Asn Val Gly Leu Val Asp Trp He Thr Leu Ala His Asp His 

115 120 125 

tac acc ate gee gtc cgc aac acc cgc ctt gtg ggc aag gag gtc gcg 

Tyr Thr He Ala Val Arg Asn Thr Arg Leu Val Gly Lys Glu Val Ala 

130 135 140 145 

get ctt etc egg tgg ctg gag gaa tct gtt caa etc tct cga age cat 

Ala Leu Leu Arg Trp Leu Glu Glu Ser Val Gin Leu Ser Arg Ser His 

150 155 160 

gtt cac eta att ggg tac age ctg ggt gca cac gtg tea gga ttt gee 

Val His Leu He Gly Tyr Ser Leu Gly Ala His Val Ser Gly Phe Ala 

165 170 175 

ggc agt tec ate ggt gga acg cac aag att ggg aga ate aca ggg ctg 

Gly Ser Ser He Gly Gly Thr His Lys He Gly Arg He Thr Gly Leu 

180 185 190 

gat gec gcg gga cct ttg ttt gag gga agt gec ccc age aat cgt ctt 

Asp Ala Ala Gly Pro Leu Phe Glu Gly Ser Ala Pro Ser Asn Arg Leu 

195 200 205 

tct cca gat gat gec aat ttt gtg gat gee att cat acc ttt acg egg 

Ser Pro Asp Asp Ala Asn Phe Val Asp Ala He His Thr Phe Thr Arg 

210 215 220 225 

gag cac atg ggc ctg age gtg ggc ate aaa cag ccc ata gga cac tat 

Glu His Met Gly Leu Ser Val Gly He Lys Gin Pro lie Gly His Tyr 

230 235 240 

gac ttc tat ccc aac ggg ggc tec ttc cag cct ggc tgc cac ttc eta 

Asp Phe Tyr Pro Asn Gly Gly Ser Phe Gin Pro Gly Cys His Phe Leu 

245 250 255 

gag etc tac aga cat att gee cag cac ggc ttc aat gee ate acc cag 

Glu Leu Tyr Arg His He Ala Gin His Gly Phe Asn Ala He Thr Gin 
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acc ata aaa tgc tec cac gag cga teg gtg cac ctt ttc ate gac tec 

Thr He Lys Cys Ser His Glu Arg Ser Val His Leu Phe He Asp Eer 

275 280 285 

ttg ctg cac gec ggc acg cag age atg gec tac ccg tgt ggt gac atg 

Leu Leu His Ala Gly Thr Gin. Ser Met Ala Tyr Pro Cys Gly Asp Met 

290 295 300 305 

aac age ttc age cag ggc ctg tgc ctg age tgc aag aag ggc cgc tgc 

Asn Ser Phe Ser Gin Gly Leu Cys Leu Ser Cys Lys Lys Gly Arg Cys 

310 315 320 

aac acg ctg ggc tac cac gtc cgc cag gag ccg egg age aag age aag 

Asn Thr Leu Gly Tyr His Val Arg Gin Glu Pro Arg Ser Lys Ser Lys 

325 330 335 

agg etc ttc etc gta acg cga gee cag tec ccc ttc aaa gtt tat cat 

Arg Leu Phe Leu Val Thr Arg Ala Gin Ser Pro Phe Lys Val Tyr His 

340 345 350 

tac eag tta aag ate cag ttc ate aac caa act gag acg cca ata caa 

Tyr Gin Leu Lye lie Gin Phe He Aen Gin Thr Glu Thr Pro lie Gin 

355 360 365 

aca act ttt acc atg tea eta etc gga aca aaa gag aaa atg cag aaa 

Thr Thr Phe Thr Met Ser Leu Leu Gly Thr Lys Glu Lys Met Gin Lys 

370 375 3S0 385 

att ccc ate act ctg ggc aaa gga att get agt aat aaa acg tat tec 

He Pre He Thr Leu Gly Lys Gly He Ala Ser Asn Lys Thr Tyr Ser 

390 395 400 

ttt ctt ate acg ctg gat gtg gat ate ggc gag ctg ate atg ate aag 

Phe Leu He Thr Leu Asp Val Asp He Gly Glu Leu He Met He Lys 

405 410 415 

ttc aag tgg tjaa aac agt gca gtg tgg gec aat gtc tgg gac acg gtc 

Phe Lys Trp Glu Asn Ser Ala Val Trp Ala Asn Val Trp Asp Thr Val 

420 425 430 

cag acc ate ate cca tgg age aca ggg ccg cgc cac tea ggc etc gtt 

Gin Thr He He Pro Trp Ser Thr Gly Pro Arg His Ser Gly Leu Val 

435 440 445 

ctg aag acg ate aga gtc aaa gca gga gaa acc cag caa aga atg aca 

Leu Lys Thr He Arg Val Lys Ala Gly Glu Thr Gin Gin Arg Met Thr 

450 455 4S0 465 

ttt tgt tea gaa aae aca gat gac eta eta ctt cgc cca acc cag gaa 

Phe Cys Ser Glu Asn Thr Asp Asp Leu Leu Leu Arg Pre Thr Gin Glu 

470 475 480 

aaa ate ttc gtg aaa tgt gaa ata aag tct aaa aca tea aag cga aag 

Lys He Phe Val Lys Cys Glu He Lys Ser Lys Thr Ser Lys Arg Lys 

485 490 495 

ate aga tga gatttaatga agacccagtg taaagaataa atgaatctta 
He Arg * 


<210> SEQ ID HO 2 0 

<211> LENGTH: 499 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapien 

<400> SEQUENCE : 2 0 

Met Asp Thr Ser Pro Leu Cys Phe 
1 5 

Phe He Gin ser ser Ala Leu Gly 

20 


Ser He Leu Leu Val Leu Cys He 
10 15 

Gin ser Leu Lys Pro Glu Pro Phe 

25 30 
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Gly Arg Arg Ala Gin Ala Val Glu Thr Asn Lys Thr Leu His Glu Met 
35 40 45 

Lys Thr Arg Phe Leu Leu Phe Gly Glu Thr Asn Gin Gly Cys Gin lie 
50 55 60 

Arg lie Asn His Pro Asp Thr Leu Gin Glu Cys Gly Phe Asn Ser Ser 
65 70 75 80 

Leu Pro Leu Val Met lie lie His Gly Trp Ser Val Asp Gly Val Leu 
85 90 95 

Glu Asn Trp He Trp Gin Met Val Ala Ala Leu Lys Ser Gin Pro Ala 
100 105 110 

Gin Pro Val Asn Val Gly Leu Val Asp Trp He Thr Leu Ala Hie Asp 
115 120 125 

His Tyr Thr He Ala Val Arg Asn Thr Arg Leu Val Gly Lys Glu Val 
130 135 140 

Ala Ala Leu Leu Arg Trp Leu Glu Glu Ser Val Gin Leu Ser Arg Ser 
145 150 155 160 

His Val His Leu lie Gly Tyr Ser Leu Gly Ala His Val Ser Gly Phe 
165 170 175 

Ala Gly Ser Ser Tie Gly Gly Thr His Lys lie Gly Arg Tie Thr Gly 
ISO 185 190 

Leu Asp Ala Ala Gly Pro Leu Phe Glu Gly Ser Ala Pro Ser Asn Arg 
195 200 205 

Leu Ser Pro Asp Asp Ala Asn Phe Val Asp Ala lie His Thr Phe Thr 
210 215 220 

Arg Glu His Met Gly Leu Ser Val Gly He Lys Gin Pro He Gly His 
225 230 235 240 

Tyr Asp Phe Tyr Pro Asn Gly Gly Ser Phe Gin Pro Gly Cys His Phe 

245 250 255 

Leu Glu Leu Tyr Arg His lie Ala Gin His Gly Phe Asn Ala He Thr 
260 265 270 

Gin Thr lie Lys Cys Ser His Glu Arg Ser Val His Leu Phe He Asp 

275 280 285 

Ser Leu Leu His Ala Gly Thr Gin Ser Met Ala Tyr Pro Cys Gly Asp 

290 295 300 

Met Asn Ser Phe Ser Gin Gly Leu Cys Leu Ser Cys Lys Lys Gly Arg 
305 310 315 320 

Cys Asn Thr Leu Gly Tyr His Val Arg Gin Glu Pro Arg Ser Lys Ser 

325 330 335 

Lys Arg Leu Phe Leu Val Thr Arg Ala Gin Ser Pro Phe Lys Val Tyr 
340 345 350 

His Tyr Gin Leu Lys He Gin Phe lie Asn Gin Thr Glu Thr Pro lie 
355 360 365 

Gin Thr Thr Phe Thr Met Ser Leu Leu Gly Thr Lys Glu Lys Met Gin 
370 375 380 

Lys He Pro He Thr Leu Gly Lys Gly He Ala Ser Asn Lys Thr Tyr 
385 390 395 400 

Ser Phe Leu He Thr Leu Asp Val Asp Tie Gly Glu Leu Tie Met Tie 
405 410 415 

Lys Phe Lys Trp Glu Asn Ser Ala Val Trp Ala Asn Val Trp Asp Thr 
420 425 430 
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Val Gin Thr lie lie Pro Trp Ser Thr Gly Pro Arg His Ser Gly Leu 
435 440 445 

Val Leu Lys Thr He Arg Val Lys Ala Gly Glu Thr Gin Gin Arg Met 

450 455 4e0 

Thr Phe Cys Ser Glu Asn Thr Asp Asp Leu Leu Leu Arg Pro Thr Gin 

465 470 475 480 

Glu Lys lie Phe Val Lye Cys Glu lie Lys Ser Lys Thr Ser Lys Arg 

485 490 495 


Lys lie Arg 


<210> SEQ ID HO 21 

<211> LENGTH : 134 6 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapien 

<2 2 0> FEATURE: 

<221> NAME/KEY : CDS 

<222> LOCATION: (10 ) (1077 ) 

<223> OTHER INFORMATION : Nucleotide sequence encoding paraoxcnase 
1 (PON1) 


<4 0 0> SEQUENCE: 21 


cccccgacc atg gcg aag ctg att gcg etc acc etc ttg ggg atg gga ctg 5 1 

Met Ala Lys Leu He Ala Leu Thr Leu Leu Gly Met Gly Leu 
15 10 


gaa etc ttc sgg aac cac cag tot tat tac caa aca cga ctt aat got 9 9 

Ala Leu Phe Arg Asn His Gin Ser Ser Tyr Gin Thr Arg Leu Asn Ala 
15 20 25 30 


etc cga gag gta caa ccc gta gaa ctt cct aac tgt aat tta gtt aaa 
Leu Arg Glu Val Gin Pro Val Glu Leu Pro Asn Cys Asn Leu Val Lys 


gga ate gaa act ggc tct gaa gac atg gag ata ctg cct aat gga ctg 
Gly He Glu Thr Gly Ser Glu Asp Met Glu He Leu Pro Asn Gly Leu 


get ttc att age tct gga tta aag tat cct gga ata aag age ttc aac 
Ala Phe lie Ser Ser Gly Leu Lys Tyr Pro Gly lie Lys Ser Phe Asn 


ccc aac agt cct gga aaa ata ctt ctg atg gac ctg aat gaa gaa gat 29 1 

Pro Asn Ser Pro Gly Lys lie Leu Leu Met Asp Leu Asn Glu Glu Asp 
80 85 90 


oca aca gtg ttg gaa ttg ggg ate act gga agt aaa ttt gat gta tct 33 9 

Pro Thr Val Leu Glu Leu Gly He Thr Gly Ser Lys Phe Asp Val Ser 
95 100 105 110 


tea ttt aac cct cat ggg att age aca ttc aca gat gaa gat aat gec 
Ser Phe Asn Pro His Gly lie Ser Thr Phe Thr Asp Glu Asp Asn Ala 
115 120 125 


atg tac etc ctg gtg gtg aac cat cca gat gee aag tec aca gtg gag 
Met Tyr Leu Leu Val Val Asn His Pro Asp Ala Lys Ser Thr Val Glu 
130 135 140 


ttg ttt aaa ttt caa gaa gaa gaa aaa teg ctt ttg cat eta aaa acc 
Leu Phe Lys Phe Gin Glu Glu Glu Lys Ser Leu Leu His Leu Lys Thr 
145 150 155 


ate aga cat aaa ctt ctg cct aat ttg aat gat att gtt get gtg gga 
lie Arg His Lys Leu Leu Pro Asn Leu Asn Asp lie Val Ala Val Gly 
160 165 170 


cct gag cac ttt tat ggc aca aat gat cac tat ttt ctt gac ccc tac 
Pro Glu His Phe Tyr Gly Thr Asn Asp His Tyr Phe Leu Asp Pro Tyr 
175 ISO 185 190 


tta caa tec tgg gag atg tat ttg ggt tta gcg tgg teg tat gtt gtc 


627 
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Leu Gin Ser Trp Glu Met Tyr Leu Gly Leu Ala Trp Ser Tyr Val Val 

195 200 205 

tac tat agt eca agt gaa gtt cga gtg gtg gca gaa gga ttt gat ttt 675 
Tyr Tyr Ser Pro Ser Glu Val Arg Val Val Ala Glu Gly Phe Asp Phe 
210 215 220 

get aat gga ate aac att tea ccc gat ggc aag tat gtc tat ata get 72 3 

Ala Asn Gly lie Asn lie Ser Pro Asp Gly Lys Tyr Val Tyr lie Ala 
225 230 235 

gag ttg ctg get cat aag att cat gtg tat gaa aag cat get aat tgg 77 1 

Glu Leu Leu Ala His Lye lie His Val Tyr Glu Lys His Ala Asn Trp 
240 245 250 

act tta act eca ttg aag tec ctt gac ttt aat acc etc gtg gat aac 819 
Thr Leu Thr Pro Leu Lys Ser Leu Asp Phe Asn Thr Leu Val Asp Asn 
255 260 265 270 

ata tct gtg gat cct gag aca gga gac ctt tgg gtt gga tgc eat ccc 86 7 

lie Ser Val Asp Pro Glu Thr Gly Asp Leu Trp Val Gly Cys His Pro 
275 280 235 

aat ggc atg aaa ate ttc ttc tat gac tea gag aat cct cct gca tea 915 
Asn Gly Met Lys lie Phe Phe Tyr Asp Ser Glu Asn Pro Pro Ala Ser 
290 295 300 

gag gtg ctt cga ate cag aac att eta aca gaa gaa cct aaa gtg aca 96 3 

Glu Val Leu Arg Tie Gin Asn Tie Leu Thr Glu Glu Pro Lys Val Thr 
305 310 315 

cag gtt tat gca gaa aat ggc sea gtg ttg caa ggc agt aca gtt gee 1011 
Gin Val Tyr Ala Glu Asn Gly Thr Val Leu Gin Gly Ser Thr Val Ala 
320 325 330 

tct gtg tac aaa ggg aaa ctg ctg att ggc aca gtg ttt cac aaa get 105 9 

Ser Val Tyr Lys Gly Lys Leu Leu lie Gly Thr Val Phe His Lys Ala 
335 340 345 350 

ctt tac tgt gag etc taa cagaccgatt tgcacccatg ccatagaaac 110 7 
Leu Tyr Cys Glu Leu * 
355 

tgaggecatt atttcaaccg ettgecatat tccgaggacc cagtgttctt agctgaacaa 1167 

tgaatgctga ccctaaatgt ggacatcatg aagcatcaaa gcactgttta actgggagtg 122 7 

atatgatgtg tagggctttt ttttgagaat acactatcaa atcagtcttg gaatacttga 128 7 

aaacctcatt taccataaaa atccttctca ctaaaatgga taaatcagtt aaaaaaaaa 134 6 

<210> SEQ ID HO 22 

<211> LENGTH: 355 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapien 

<400> SEQUENCE: 22 

Met Ala Lys Leu lie Ala Leu Thr Leu Leu Gly Met Gly Leu Ala Leu 

15 10 15 

Phe Arg Asn. His Gin Ser Ser Tyr Gin Thr Arg Leu Asn Ala Leu Arg 
20 25 30 

Glu Val Gin Pro Val Glu Leu Pro Asn Cys Asn Leu Val Lys Gly lie 
35 40 45 

Glu Thr Gly Ser Glu Asp Met Glu He Leu Pro Asn Gly Leu Ala Phe 
50 55 60 

He Ser Ser Gly Leu Lys Tyr Pro Gly He Lys Ser Phe Asn Pro Asn 


Ser Pro Gly Lys He Leu Leu Met Asp Leu Asn Glu Glu Asp Pro Thr 
85 90 95 
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Val Leu Glu Leu Gly lie Thr Gly Ser Lys Phe Asp Val Ser Ser Phe 
100 105 110 

Asn Pro His Gly lie Ser Thr Phe Thr Asp Glu Asp Asn Ala Met Tyr 
115 120 125 

Leu Leu Val Val Asn Hie Pro Asp Ala Lys Ser Thr Val Glu Leu Phe 
130 135 140 

Lys Phe Gin Glu Glu Glu Lys Ser Leu Leu His Leu Lys Thr lie Arg 
145 150 155 160 

His Lys Leu Leu Pro Asn Leu Asn Asp lie Val Ala Val Gly Pro Glu 

165 170 175 

His Phe Tyr Gly Thr Asn Asp His Tyr Phe Leu Asp Pro Tyr Leu Gin 
180 185 190 

Ser Trp Glu Met Tyr Leu Gly Leu Ala Trp Ser Tyr Val Val Tyr Tyr 
195 200 205 

Ser Pro Ser Glu Val Arg Val Val Ala Glu Gly Phe Asp Phe Ala Asn 
210 215 220 

Gly He Asn He Ser Pro Asp Gly Lys Tyr Val Tyr He Ala Glu Leu 
225 230 235 240 

Leu Ala His Lys Tie His Val Tyr Glu Lys His Ala Asn Trp Thr Leu 
245 250 255 

Thr Pro Leu Lys Ser Leu Asp Phe Asn Thr Leu Val Asp Asn He Ser 
260 265 270 

Val Asp Pro Glu Thr Gly Asp Leu Trp Val Gly Cys His Pro Asn Gly 
275 280 285 

Met Lys lie Phe Phe Tyr Asp Ser Glu Asn Pro Pro Ala Ser Glu Val 
290 295 300 

Leu Arg lie Gin Asn He Leu Thr Glu Glu Pro Lys Val Thr Gin Val 
305 310 315 320 

Tyr Ala Glu Asn Gly Thr Val Leu Gin Gly Ser Thr Val Ala Ser Val 

325 330 335 

Tyr Lys Gly Lys Leu Leu lie Gly Thr Val Phe His Lys Ala Leu Tyr 

340 345 350 

Cys Glu Leu 
355 


<210> SEQ ID NO 2 3 

<211> LENGTH : 15 7 0 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapien 

<220> FEATURE: 

<221> NAME/KEY: CDS 

<222> LOCATION: (1) (10 97) 

<223> OTHER INFORMATION : Nucleotide sequence encoding paraoxonase 
2 (PON 2 ) 

<400> SEQUENCE: 2 3 

egg age gag gca gcg cgc ccg get ccc gcg cca tgg ggc ggc tgg tgg 

Arg Ser Glu Ala Ala Arg Pro Ala Pro Ala Pro Trp Gly Gly Trp Trp 
15 10 15 

ctg tgg get tgc tgg gga teg cgc tgg cgc tec tgg gcg aga ggc ttc 

Leu Trp Ala Cys Trp Gly Ser Arg Trp Arg Ser Trp Ala Arg Gly Phe 


tgg caa tea gaa ate gac tta aag act cca gag aag tag aat etg tag 
Trp His ser Glu He Asp Leu Lys Pro Pro Glu Lys * Asn Leu * 
35 40 45 


144 
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acc ttc cac act gcc acc tga tta aag gaa ttg aag ctg get ctg aag 
Thr Phe His Thr Ala Thr * Leu Lys Glu Leu Lys Leu Ala Leu Lys 


ata ttg aca tac ttc cca atg gtc tgg ctt ttt tta gtg tgg gtc taa 
He Leu Thr Tyr Phe Pro Met Val Trp Leu Phe Leu Val Trp Val * 


aat tec cag gac tec aca get ttg cac cag ata age ctg gag gaa tac 
Asn Ser Gin Asp Ser Thr Ala Leu His Gin lie Ser Leu Glu Glu Tyr 


taa tga tgg ate taa aag aag aaa aac caa ggg cac ggg aat taa gaa 

* * Trp lie * Lye Lys Lys Asn Gin Gly His Gly Asn * Glu 

95 100 

tea gtc gtg ggt ttg att tgg cct cat tea ate cac atg gca tea gca 

Ser Val Val Gly Leu lie Trp Pro His Ser lie His Met Ala Ser Ala 

105 110 115 120 

ctt tea tag aca acg atg aca cag ttt ate tct ttg ttg taa acc acc 

Leu Ser * Thr Thr Met Thr Gin Phe lie Ser Leu Leu * Thr Thr 
125 130 

cag aat tea aga ata cag tgg aaa ttt tta aat ttg aag aag cag aaa 

Gin Asn Ser Arg He Gin Trp Lys Phe Leu Asn Leu Lys Lys Gin Lys 

135 140 145 150 

att etc tgt tgc ate tga aaa cag tea aac atg age ttc ttc caa gtg 

He Leu Cys Cys He * Lys Gin Ser Asn Met Ser Phe Phe Gin Val 

155 ISO 165 

tga atg aca tea cag ctg ttg gac egg cac att tct atg cca caa atg 

* Met Thr Ser Gin Leu Leu Asp Arg His lie Ser Met Pro Gin Met 

170 175 180 

acc act act tct ctg ate ctt tct taa agt att tag aaa cat act tga 

Thr Thr Thr Ser Leu He Leu Ser * Ser He * Lys His Thr * 
185 190 

act tac act ggg caa atg ttg ttt act aca gtc caa atg aag tta aag 

Thr Tyr Thr Gly Gin Met Leu Phe Thr Thr Val Gin Met Lys Leu Lys 

195 200 205 

tgg tag cag aag gat ttg att cag caa atg gga tea ata ttt cac ctg 

Trp * Gin Lys Asp Leu lie Gin Gin Met Gly Ser He Phe His Leu 

210 215 220 

atg ata agt ata tct atg ttg ctg aca tat tgg etc atg aaa ttc atg 

Met He Ser lie Ser Met Leu Leu Thr Tyr Trp Leu Met Lys Phe Met 

225 230 235 240 

ttt tgg aaa aac aca eta ata tga att taa etc agt tga agg tac ttg 

Phe Trp Lys Asn Thr Leu He * He * Leu Ser * Arg Tyr Leu 
245 250 

age tgg ata cac tgg tgg ata att tat eta ttg ate ctt cct egg ggg 

Ser Trp lie His Trp Trp lie He Tyr Leu Leu He Leu Pro Arg Gly 

255 260 2e5 

aca tct ggg tag get gtc ate eta atg gee aga age tct teg tgt atg 

Thr Ser Gly * Ala Val He Leu Met Ala Arg Ser Ser Ser Cys Met 

270 275 280 

acc cga aca ate etc cct cgt cag agg ttc tec gca tec aga aca ttc 

Thr Arg Thr He Leu Pro Arg Gin Arg Phe Ser Ala Ser Arg Thr Phe 

285 290 295 300 

tat ctg aga age eta cag tga eta cag ttt atg cca aca atg ggt ctg 

Tyr Leu Arg Ser Leu Gin * Leu Gin Phe Met Pro Thr Met Gly Leu 

305 310 315 

ttc tec aag gaa gtt ctg tag cct cag tgt atg atg gga age tgc tea 

Phe ser Lys Glu val Leu * Pro Gin cys Met net Gly ser cys ser 

320 325 330 
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tag gca ctt tat acc aca gag cct tgt att gtg aac tct aa attgtacttt 1107 
* Ala Leu Tyr Thr Thr Glu Pro Cys lie Val Asn Ser 
335 340 

tggcatgaaa gtgcgataac ttaacaatta attttctatg aattgctaat tctgagggaa 1167 

tttaaccagc aacattgacc cagaaatgta tggcatgtgt agttaatttt attccagtaa 122 7 

ggaacggccc ttttagttct tagagcactt ttaacaaaaa aggaaaatga acaggttctt 128 7 

taaaatgcca agcaagggac agaaaagaaa gctgctttcg aataaagtga atacattttg 134 7 

cacaaagtaa gcctcacctt tgccttccaa ctgccagaac atggattcca ctgaaataga 140 7 

gtgaattata tttccttaaa atgtgagtga cctcacttct ggcactgtga Gtactatggn 146 7 

tgtttagaac tactgataac gtattttgat gttttgtact tacatctttg tttaccatta 152 7 

aaaagttgga gttatattaa agactaacta aaatcccagt ttt 157 0 

<210> SEQ ID HO 2 4 

<211> LENGTH: 3 42 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapien 

<400> SEQUENCE : 2 4 

Arg Ser Glu Ala Ala Arg Pro Ala Pro Ala Pro Trp Gly Gly Trp Trp 
15 10 15 

Leu Trp Ala Cys Trp Gly Ser Arg Trp Arg Ser Trp Ala Arg Gly She 
20 25 30 

Trp His Ser Glu lie Asp Leu Lys Pro Pro Glu Lys Asn Leu Thr Phe 
35 40 45 

His Thr Ala Thr Leu Lys Glu Leu Lys Leu Ala Leu Lys lie Leu Thr 
50 55 60 

Tyr Phe Pro Met Val Trp Leu. Phe Leu Val Trp Val Asn Ser Gin Asp 
65 70 75 80 

Ser Thr Ala Leu His Gin lie Ser Leu Glu Glu Tyr Trp lie Lys Lys 
85 90 95 

Lys Asn Gin Gly His Gly Asn Glu Ser Val Val Gly Leu lie Trp Pro 
100 105 110 

His Ser lie His Met Ala Ser Ala Leu Ser Thr Thr Met Thr Gin Phe 
115 120 125 

lie Ser Leu Leu Thr Thr Gin Asn Ser Arg lie Gin Trp Lys Phe Leu 
130 135 140 

Asn Leu Lys Lys Gin Lys lie Leu Cys Cys lie Lys Gin Ser Asn Met 
145 150 155 160 

Ser Phe Phe Gin Val Met Thr Ser Gin Leu Leu Asp Arg His lie Ser 
165 170 175 

Met Pro Gin Met Thr Thr Thr Ser Leu lie Leu Ser Ser lie Lys His 
ISO 185 190 

Thr Thr Tyr Thr Gly Gin Met Leu Phe Thr Thr Val Gin Met Lys Leu 
195 200 205 

Lys Trp Gin Lys Asp Leu lie Gin Gin Met Gly Ser lie Phe Hie Leu 
210 215 220 

Met lie Ser lie Ser Met Leu Leu Thr Tyr Trp Leu Met Lys Phe Met 
225 230 235 240 

Phe Trp Lys Asn Thr Leu lie lie Leu ser Arg Tyr Leu ser Trp lie 
245 250 255 
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His Trp Trp lie lie Tyr Leu Leu lie Leu Pro Arg Gly Thr Ser Gly 
260 265 270 

Ala Val lie Leu Met Ala Arg Ser Ser Ser Cys Met Thr Arg Thr lie 
275 280 285 

Leu Pro Arg Gin Arg Phe Ser Ala Ser Arg Thr Phe Tyr Leu Arg Ser 
290 295 300 

Leu Gin Leu Gin Phe Met Pro Thr Met Gly Leu Phe Ser Lys Glu Val 
305 310 315 320 

Leu Pro Gin Cys Met Met Gly Ser Cys Ser Ala Leu Tyr Thr Thr Glu 
325 330 335 

Pro Cys lie Val Asn Ser 
340 


<210> SEQ ID NO 25 

<211> LENGTH: 533 

<212> TYPE: DNA 

<213> ORGANISM: Homo eapien 

<220> FEATURE: 

<221> NAME/KEY: CDS 

<2 22> LOCATION: (47). ..(346) 

<223> OTHER INFORMATION: Nucleotide sequence encoding apolipoprotein 
C-TII(APOC3 ) 


<4 0 0> SEQUENCE: 2 5 


tgctcagttc atccctagag gcagctgctc caggaacaga ggtgcc atg cag ccc 55 

Met Gin Pro 


egg gta etc ctt gtt gtt gee etc ctg gcg etc ctg gee tct gec cga 
Arg Val Leu Leu Val Val Ala Leu Leu Ala Leu Leu Ala Ser Ala Arg 


get tea gag gee gag gat gec tec ctt etc age ttc atg cag ggt tac 151 
Ala Ser Glu. Ala Glu Asp Ala Ser Leu Leu Ser Phe Met Gin Gly Tyr 
20 25 30 35 


atg aag cac gec ace aag acc gec aag gat gca ctg age age gtg cag 
Met Lys His Ala Thr Lye Thr Ala Lys Asp Ala Leu Ser Ser Val Gin 


gag tec cag gtg gec cag cag gec agg ggc tgg gtg acc gat ggc ttc 24 7 
Glu Ser Gin Val Ala Gin Gin Ala Arg Gly Trp Val Thr Asp Gly Phe 
55 60 65 

agt tec ctg aaa gac tac tgg age acc gtt aag gac aag ttc tct gag 295 
Ser Ser Leu Lys Asp Tyr Trp Ser Thr Val Lys Asp Lys Phe Ser Glu 
70 75 80 

ttc tgg gat ttg gac cct gag gtc aga cca act tea gee gtg get gee 34 3 
Phe Trp Asp Leu Asp Pro Glu Val Arg Pro Thr Ser Ala Val Ala Ala 

85 90 95 

tga gacctcaata ccccaagtcc acctgcctat ccatcctgcg agctccttgg 39 6 

gtcctgcaat ctccagggct gcccctgtag gttgcttaaa agggacagta ttctcagtgc 45 6 

tctcGtaccc cacctcatge ctggcccccc tccaggcatg ctggcctccc aataaagctg 516 

gacaagaagc tgctatg 53 3 


<210> SEQ ID NO 2 6 

<211> LENGTH: 9 9 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapien 


<4 0 0> SEQUENCE: 2 6 
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Met Gin Pro Arg Val Leu Leu Val Val Ala Leu Leu Ala Leu Leu Ala 

15 10 15 

Eer Ala Arg Ala Ser Glu Ala Glu Asp Ala Ser Leu Leu Ser Phe Met 
20 25 30 

Gin Gly Tyr Met Lys Hie Ala Thr Lys Thr Ala Lys Asp Ala Leu Ser 
35 40 45 

Ser Val Gin Glu Ser Gin Val Ala Gin Gin Ala Arg Gly Trp Val Thr 
50 55 60 

Asp Gly Phe Ser Ser Leu Lys Asp Tyr Trp Ser Thr Val Lys Asp Lys 
65 70 75 80 

Phe Ser Glu Phe Trp Asp Leu Asp Pro Glu Val Arg Pro Thr Ser Ala 


Val Ala Ala 


<210> SEQ ID NO 2 7 

<211> LENGTH : 8 92 5 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapien 

<2 2 0> FEATURE: 

<2 21> NAME /KEY: m±sc_f eature 

<222> LOCATION: 5081, 5032, 5083, 5084, 5085, 5086, 5087, 5088, 5089, 

5090, 5091, 5092, 5093, 5094, 5095, 5096, 5097, 5098, 5099, 5100, 
5101, 5102, 5103, 5104, 5105, 5106, 5107, 5108, 5109, 5110, 
5111, 5112, 5113, 5114, 5115, 5116, 5117, 5118, 5119 

<223> OTHER INFORMATION: n = A,T,C or G 

<2 2 0> FEATURE: 

<2 21> NAME /KEY: misc_f eature 

<222> LOCATION: 5120, 5121, 5122, 5123, 5124, 5125, 5126, 5127, 5128, 

5129, 5130, 5131, 5132, 5133, 5134, 5135, 5136, 5137, 5138, 5139, 
5140, 5141, 5142, 5143, 5144, 5145, 5146, 5147, 514B, 5149, 
5150, 5151, 5152, 5153, 5154, 5155, 5156, 5157, 5158 

<223> OTHER INFORMATION: n = A,T,C or G 

<2 2 0> FEATURE: 

<221> NAME /KEY: mi sc_f eature 

<222> LOCATION: 5159, 5160, 5161, 5162, 5163, 5164, 5165, 5166, 5167, 

5168, 5169, 5170, 5171, 5172, 5173, 5174, 5175, 5176, 5177, 5178, 
5179 

<223> OTHER INFORMATION: n = A,T,C or G 

<2 2 0> FEATURE: 

<221> NAME/KEY: CDS 

<2 22> LOCATION: ( 5 0 2 0 ) . . . ( 6 1 6 2 ) 

<223> OTHER INFORMATION: Nucleotide encoding ATP-binding cassette (ABC1 ) 
<4 0 0> SEQUENCE: 2 7 


ctcagtgtca 

gctgctgctg 

gaagtggcct 

ggcctctatt 

tatcttcctg 

atcctgatct 

60 

ctgttcggct 

gagctaccca 

ccctatgaac 

aacatgaatg 

ccattttcca 

aataaagcca 

120 

tgccctctgc 

aggaacac tt 

ccttgggttc 

aggggattat 

ctgtaatgcc 

aacaacccct 

180 

gtttccgtta 

cccgactcct 

ggggaggctc 

ccggagttgt 

tggaaacttt 

aacaaatcca 

240 

ttgtggctcg 

cctgttctca 

gatgctcgga 

ggcttctttt 

atacagccag 

aaagacacca 

300 

gcatgaagga 

catgcgcaaa 

gttctgagaa 

cattacagca 

gatcaagaaa 

tccagctcaa 

360 

acttgaagct 

tcaagatttc 

ctggtggaca 

atgaaacctt 

ctctgggttc 

ctgtatcaca 

420 

acctctctct 

cccaaagtct 

actgtg^aca 

agatgctgag 

ggctgatgtc 

attctccaca 

480 

aggtattttt 

gcaaggctac 

cagttacatt 

tgacaagtct 

gtgcaatgga 

tcaaaatcag 

540 

aagagatgat 

tcaacttggt 

gaccaagaag 

tttctgagct 

ttgtggccta 

ccaagggaga 

600 

aactggctgc 

agcagagcga 

gtacttcgtt 

ccaacatgga 

catcctgaag 

ccaatcctga 

660 

gaacactaaa 

ctctacatct 

cccttcccga 

gcaaggagct 

ggctgaagcc 

acaaaaacat 

720 
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tgctgcatag 

tcttgggact 

ctggcccagg 

agctgttcag 

catgagaagc 

tggagtgaca 

780 

tgcgacagga 

ggtgatgttt 

otgaccaatg 

tgaacagctc 

cagctcctcc 

acccaaatct 

840 

accaggctgt 

gtctcgtat-t 

gtctgcgggc 

atcccgaggg 

aggggggctg 

aagatcaagt 

900 

ctctcaaatg 

gtatgaggac 

aacaactaca 

aagccctott 

tggaggcaat 

ggcactgagg 

960 

aagatgctga 

aaccttctat 

gacaactcta 

caactcctta 

ctgcaatgat 

ttgatgaaga 

1020 

atttggag-tc 

tagtcctctt 

tcccgcatta 

tctggaaagc 

tctgaagccg 

ctgctcgttg 

1080 

ggaagatcct 

gtatacacct 

gacactccag 

ccacaaggca 

ggtcatggct 

gaggtgaaca 

1140 

agaccttcca 

ggaactggc-t 

gtgttccatg 

atctggaagg 

catg-bgggag 

gaactcagcc 

1200 

ccaagatctg 

gaccttcatg 

gagaacagcc 

aagaaatgga 

ccttgtccgg 

atgctgttgg 

1260 

acagcaggga 

caatgaccac 

ttttgggaac 

agcagttgga 

tggct-tagat 

tggacagccc 

1320 

aagacatcgt 

ggcgtttttg 

gccaagcacc 

cagaggatgt 

ccagtccagt 

aatggttctg 

1380 

tgtacacctg 

gagagaagct 

ttcaacgaga 

ctaaccaggc 

aatccggacc 

atatctcgct 

1440 

tcatggagtg 

tgtcaacctg 

aacaagctag 

aacccatagc 

aacagaagtc 

tggctcatca 

1500 

acaagtccat 

ggagctgctg 

gatgagagga 

agttctgggc 

tggtattgtg 

ttcactggaa 

1560 

ttactccagg 

□agcattgag 

ctgccccatc 

atgtcaagta 

caagatccga 

atggacattg 

1620 

acaatgtgga 

gaggacaaat 

aaaatcaagg 

atgggtactg 

ggaccctggt 

cctcgagctg 

1680 

acccctttga 

ggacatgcgg 

-tacgtctggg 

ggggcttcgc 

ctacttgcag 

gatgtggtgg 

1740 

agcaggcaat 

catcagggtg 

ctgacgggca 

ccgagaagaa 

aactggtgtc 

tatatgcaac 

1800 

agatgcccta 

tccctgttac 

gttgatgaca 

-tctttc-tgcg 

gg-tgatgagc 

cggtcaatgc 

1860 

ccctcttcat 

gacgctggcc 

tggatttact 

cagtggctgt 

gatcatcaag 

ggcatcgtgt 

1920 

atgagaagga 

ggcacggctg 

aaagagacca 

tgcggatcat 

gggcctggac 

aacagcatcc 

1980 

tctggtttag 

ctggttcatt 

agtagcctca 

ttcctcttct 

tgtgagcgct 

ggcctgctag 

2040 

tggtcatcct 

gaagttagga 

aacctgctgc 

cctacagtga 

tcccagcgtg 

gtgtttgtct 

2 100 

tcctgtccgt 

gt-ttgctgtg 

gtgacaatcc 

-tgcagtgc-tt 

cctgattagc 

acactcttct 

2160 

ccagagccaa 

cctggcagca 

gcctgtgggg 

gcatcatcta 

cttcacgctg 

tacctgccct 

2220 

acgtcctgtg 

tgtggcatgg 

caggactacg 

tgggcttcac 

actcaagatc 

ttcgctagcc 

2280 

tgctgtctcc 

tgtggctttt 

gggtttggct 

gtgagtactt 

tgcccttttt 

gaggagcagg 

2340 

gcattggagt 

gcagtgggac 

aacctgtttg 

agagtcctgt 

ggaggaagat 

ggcttcaatc 

2400 

tcaccacttc 

gg-tc-tccatg 

atgctgtttg 

acaccttcct 

ctatggggtg 

atgacctggt 

2460 

acattgaggc 

tgtctttcca 

ggccagtacg 

gaattcccag 

gccctggtat 

tttccttgca 

2520 

ccaagtccta 

ctggtttggc 

gaggaaagtg 

atgagaagag 

ccaccctggt 

tccaaccaga 

2580 

agagaatatc 

agaaatctgc 

atggaggagg 

aacccaccca 

cttgaagctg 

ggcgtgtcca 

2640 

ttcagaacct 

ggtaaaagtc 

taccgagatg 

ggatgaaggt 

ggctgtcgat 

ggcctggcac 

2700 

tgaattttta 

tgagggccag 

atcacctcct 

tcctgggcca 

caatggagcg 

gggaagacga 

2760 

ccaccatgtc 

aatcctgacc 

gggttgttcc 

ccccgacctc 

gggcaccgcc 

tacatcctgg 

2820 

gaaaagacat 

tngctctgag 

atgagcacca 

tccggcagaa 

cctgggggtc 

tgtccccagc 

2880 

ataacg-tgct 

gtttgacatg 

ctgactgtcg 

aagaacacat 

ctgg-btctat 

gcccgcttga 

2940 

aagggctc-tc 

tgagaagcac 

gtgaaggcgg 

agatggagca 

gatggccctg 

gatg-t-tggtt 

3000 
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tgccatcaag 

caagctgaaa 

agcaaaacaa 

gccagctgtc 

aggtggaatg 

cagagaaagc 

3060 

tatctgtggc 

cttggccttt 

gtcgggggat 

ctaaggttgt 

cattctggat 

gaacccacag 

3120 

ctggtgtgga 

cccttactcc 

cgcaggggaa 

tatgggagct 

getgetgaaa 

taccgacaag 

3 180 

gccgcaccat 

tattctctct 

acacaccaca 

tggatgaagc 

ggacgtcctg 

ggggacsgga 

3240 

ttgccatcat 

ctcccatggg 

aagctgtgct 

gtgtgggctc 

ctccctgttt 

ctgaagaacc 

3300 

agctgggaac 

aggctactac 

ctgaccttgg 

tcaagaaaga 

tgtggaatcc 

tccctcagtt 

3360 

cctgcagaaa 

□agtagtagc 

actgtgtcat 


ggaggacagt 

gtttctcaga 

3420 

gcagttctga 

tgctggcctg 

ggcagcgacc 

atgagag-tga 

cacgctgacc 

atcgatgtct 

3480 

ctgctatctc 

caacctcatc 

aggaagcatg 

tgtctgaagc 

ccggctggtg 

gaagacatag 

3540 

ggcatgagct 

gacctatgtg 

ctgccatatg 

aagctgctaa 

ggagggagee 

tttgtggaac 

3600 

tctttcatga 

gattgatgac 

cggctctcag 

acctgggcat 

ttctagttat 

ggcatctcag 

3660 

agacgaccct 

ggaagaaata 

ttcctcaagg 

tggccgaaga 

gagtggggtg 

gatgetgaga 

3720 

cctcagatgg 

taccttgcca 

gcaagacgaa 

acaggcgggc 

etteggggae 

aagcagagct 

3780 

gtcttcgccc 

gttcactgaa 

gatgatgctg 

ctgatccaaa 

tgattctgac 

atagacccag 

3840 

aatccagaga 

gacagacttg 

ctcagtggga 

tggatggcaa 

agggtcntac 

caggtgaaag 

3900 

gctggaaact 

tacacagcaa 

cagtttgtgg 

cccttttgtg 

gaagagactg 

etaattgeca 

3960 

gacggagtcg 

gaaaggattt 

tttgctcaga 

ttgtcttgcc 

agctgtgttt 

gtctgcattg 

4020 

cccttgtgtt 

cagcctgatc 

gtgccaccct 

ttggcaagta 

ccccagcctg 

gaacttcagc 

4080 

cctggatgta 

caacgaacag 

tacacatttg 

tcagcaatga 

tgctcc-tgag 

gaeaegggaa 

4 140 

ccctggaact 

cttaaacgcc 

ctcaccaaag 

accctggctt 

cgggacccgc 

tgtatggaag 

4200 

gaaacccaat 

cccagacacg 

ccctgccagg 

caggggagga 

agagtggacc 

actgccccag 

4260 

ttccccagac 

catcatggac 

ctcttccaga 

atgggaactg 

gaeaatgeag 

aacccttcac 

4320 

ctgcatgcca 

gtgtagcagc 

gacaaaatca 

agaagatgct 

gcctgtgtgt 

cccccagggg 

4380 

caggggggct 

gcctcctcca 

caaagaaaac 

aaaacactgc 

agatatcctt 

caggacctga 

4440 

caggaagaaa 

catttcggat 

t at c t ggt ga 

agacgtatgt 

gcagatcata 

gecaaaaget 

4500 

taaagaacaa 

gatctgggtg 

aatgagttta 

ggtatggcgg 

cttttccctg 

ggtgtcagta 

4560 

atactcaagc 

acttcctccg 

agtcaagaag 

ttaatgatgc 

catcaaacaa 

atgaagaaac 

4620 

acctaaagct 

ggccaaggac 

agttctgcag 

atcgatttct 

caacagcttg 

ggaagattta 

4680 

tgacaggact 

ggacaccaaa 

aataatgtca 

aggtgtgg-tt 

caataacaag 

ggctggcatg 

4740 

caatcagctc 

tttcctgaat 

gtcatcaaca 

atgccattct 

ccgggccaac 

ctgeaaaagg 

4800 

gagagaaccc 

tagccattat 

ggaattactg 

ctttcaatca 

tcccctgaat 

ctcaccaagc 

4860 

agcagctctc 

agaggtggct 

cggatgacca 

catcagtgga 

tgtccttgtg 

tccatctgtg 

4920 

tcatctttgc 

aat gtccttc 

gtcccagcca 

gctttgtcgt 

attcctgatc 

caggageggg 

4980 

tcagcaaagc 

aaaacacctg 

cagttcatca 

gtggagtga age ctg tea 
Ser Leu Eer 

1 

tct act 
Ser Thr 
5 

5034 


ggc tct eta att ttg tct ggg at a tgt gca att sag ttg ttt cca ann 5 082 

Gly Ser Leu. lie Leu Ser Gly lie Cys Ala lie Lys Leu Phe Pro Xaa 
10 15 20 


nnn nnn nnn. nnn rmn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nun 5 13 0 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
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nta. ate ttt cct ttt cag tgc ttt ggg etc ctg gga gtt aat ggg get 
Xaa lie Phe Pro Phe Gin Cys Phe Gly Leu Leu Gly Val Asn Gly Ala 


gga aaa tea tea act ttc aag atg tta aca gga gat acc act gtt acc 
Gly Lys Ser Ser Thr Phe Lys Met Leu Thr Gly Asp Thr Thr Val Thr 


aga gga gat get ttc ctt aac att tgc agt ate tta tea aac ate cat 

Arg Gly Asp Ala Phe Leu Asn. lie Cys Ser He Leu Ser Asn He His 

90 95 100 

gaa gta cat cag aac atg ggc tac tgc cct cag ttt gat gec ate aca 

Glu Val His Gin Asn Met Gly Tyr Cys Pro Gin Phe Asp Ala lie Thr 

105 110 115 

gag ctg ttg act ggg aga gaa cac gtg gag ttc ttt gec ctt ttg aga 

Glu Leu Leu Thr Gly Arg Glu His Val Glu Phe Phe Ala Leu Leu Arg 

120 125 130 

gga gtc cca gag aaa gaa gtt ggc aag gtt ggt gag tgg gcg att egg 

Gly Val Pro Glu Lys Glu Val Gly Lys Val Gly Glu Trp Ala lie Arg 

135 140 145 

aaa ctg ggc etc gtg aag tat gga gaa aaa tat get ggt aac tat agt 

Lys Leu Gly Leu Val Lys Tyr Gly Glu Lys Tyr Ala Gly Asn Tyr Ser 

150 155 ISO 165 

gga ggc aac aaa cgc aag etc tct aca gee atg get ttg ate ggc ggg 

Gly Gly Asn Lys Arg Lye Leu Ser Thr Ala Met Ala Leu He Gly Gly 

170 175 ISO 

cct cct gtg gtg ttt ctg gat gaa ccc acc aca ggc atg gat ccc aaa 

Pro Pro Val Val Phe Leu Asp Glu Pro Thr Thr Gly Met Asp Pro Lys 

185 190 195 

gee egg egg ttc ttg tgg aat tgt gee eta agt gtt gtc aag gag ggg 

Ala Arg Arg Phe Leu Trp Asn Cys Ala Leu Ser Val Val Lys Glu Gly 

200 205 210 

aga tea gta gtg ctt aca tct cat agt atg gaa gaa tgt gaa get ctt 

Arg Ser Val Val Leu Thr Ser His Ser Met Glu Glu Cys Glu Ala Leu 

215 220 225 

tgc act agg atg gca ate atg gtc aat gga agg ttc agg tgc ctt ggc 

Cys Thr Arg Met Ala lie Met Val Asn Gly Arg Phe Arg Cys Leu Gly 

230 235 240 245 

agt gtc cag cat eta aaa aat agg ttt gga gat ggt tat aca ata gtt 

Ser Val Gin His Leu Lye Asn Arg Phe Gly Asp Gly Tyr Thr He Val 

250 255 260 

gta cga ata gca ggg tec aac ccg gac ctg aag cct gtc cag gat ttc 

Val Arg lie Ala Gly Ser Asn Pro Asp Leu Lys Pro Val Gin Asp Phe 

265 270 275 

ttt gga ctt gca ttt cct gga agt gtt eta aaa gag aaa cac egg aac 

Phe Gly Leu Ala Phe Pro Gly Ser Val Leu Lys Glu Lys His Arg Asn 

280 285 290 

atg eta caa tac cag ctt cca tct tea tta tct tct ctg gec agg ata 

Met Leu Gin Tyr Gin Leu Pro Ser Ser Leu Ser Ser Leu Ala Arg He 

295 300 305 

ttc age ate etc tec cag age aaa aag cga etc cac ata gaa gac tac 

Phe Ser He Leu Ser Gin Ser Lys Lys Arg Leu His He Glu Asp Tyr 

310 315 320 325 

tct gtt tct cag aca aca ctt gac caa gta ttt gtg aac ttt gee aag 

Ser Val Ser Gin Thr Thr Leu Asp Gin Val Phe Val Asn Phe Ala Lys 
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330 335 340 

gac caa agt gat gat gac cac tta aaa gac etc tea tta cac aaa aac 6 09 0 

Asp Gin Ser Asp Asp Asp His Leu Lys Asp Leu Ser Leu His Lye Asn 
345 350 355 

cag aca gta gtg gac gtt gca gtt etc ana tct ttt eta cag gat gag 6 13 8 

Gin Thr Val Val Asp Val Ala Val Leu Thr Ser Phe Leu Gin Asp Glu 
360 365 370 

aaa gtg aaa gaa age tat gta tga agaatcctgt teataegggg tggctgaaag 6 192 
Lys Val Lys Glu Ser Tyr Val * 

375 380 


taaagaggaa 

ctagactttc 

ctttgcacca 

tgtgaagtgt 

tgtggagaaa 

agagecagaa 

6252 

gttgatgtgg 

gaagaagtaa 

actggatact 

gtactgatac 

tattcaatgc 

aatgeaatte 

6312 

aatgcaatga 

aaacaaaatt 

ccattacagg 

ggcagtgcct 

ttgtagccta 

tgtcttgtat 

6372 

ggctctcaag 

tgaaagactt 

gaatttagtt 

ttttacctat 

acctatgtga 

aactctatta 

6432 


tggacatatg 

ggtttgaact 

cacacttttt 

tttttttttt 

tgttcctgtg 

6492 

tattctcatt 

ggggttgcaa 

caataattca 

tcaagtaatc 

atggccagcg 

attattgatc 

6552 

aaaatcaaaa 

ggtaatgeae 

atcctcattc 

actaagecat 

gccatgccca 

ggagactggt 

6612 

ttcccggtga 

cacatccatt 

gctggcaatg 

agtgtgccag 

agttattagt 

gccaagtttt 

6672 

tcagaaagtt 

tgaagcacca 

tggtgtgtca 

tgctcacttt 

tgtgaaagct 

gctctgctca 

6732 

gagtctatca 

acattgaata 

tcagttgaca 

gaatggtgcc 

atgcgtggct 

aacatcctgc 

6792 

tttgattccc 

tctgataagc 

tgttctggtg 

gcagtaacat 

gcaacaaaaa 

tgtgggtgtc 

6852 

tccaggcacg 

ggaaacttgg 

ttccattgtt 

atattgtcct 

atgettcgag 

ccatgggtct 

6912 

acagggtcat 

c cttatgaga 

c tcttaaata 

tac tt agate 

c tggtaagag 

gcaaagaatc 

6972 

aacagecaaa 

ctgctggggc 

tgcaagctgc 

tgaagecagg 

gcatgggatt 

aaagagattg 

7032 

tgcgttcaaa 

cctagggaag 

cctgtgccca 

tttgtcctga 

ctgtctgcta 

acatggtaca 

7092 

ctgcatctca 

agatgtttat 

c t g ac ac a ag 

tgtattattt 

c tggc ttttt 

gaattaatct 


agaaaatgaa 

aagatggagt 

tgtattttga 

caaaaatgtt 

tgtacttttt 

aatgttattt 

7212 

ggaattttaa 

gttctatcag 

tgacttctga 

atccttagaa 

tggcctcttt 

gtagaaccct 

7272 

gtggtataga 

ggagtatggc 

cactgcccca 

ctatttttat 

tttcttatgt 

aagtttgcat 

7332 

atcagtcatg 

actagtgect 

agaaagcaat 

gtgatggtca 

ggatctcatg 

acattatatt 

7392 

tgagtttctt 

tcagatcatt 

taggatactc 

ttaatctcac 

ttcatcaatc 

aaatattttt 

7452 

tgagtgtatg 

ctgtagctga 

aagagtat gt 

aegtaegtat 

aagactagag 

agatattaag 

7512 

tctcagtaca 

cttcctgtgc 

catgttattc 

agctcactgg 

tttacaaata 

taggttgtct 

7572 

tgtggttgta 

ggagcccact 

gtaacaatac 

tgggcagect 

tttttttttt 

ttttttaatt 

7632 

geaacaatge 

aaaagccaag 

aaagtataag 

ggtcacaagt 

ctaaacaatg 

aattcttcaa 

7692 

cagggaaaac 

agctagcttg 

aaaacttget 

gaaaaacaca 

acttgtgttt 

atggcattta 

7752 

gtaccttcaa 

ataattggct 

ttgeagat at 

tggatacocc 

attaaatctg 

acagtctcaa 

7812 

atttttcatc 

tcttcaatca 

ctagtcaaga 

aaaatataaa 

aacaacaaat 

acttccatat 

7872 

ggagcatttt 

tnagagtttt 

ctaacccagt 

cttatttttc 

tagtcagtaa 

acatttgtaa 

7932 

aaatactgtt 

tcactaatac 

ttactgttaa 

ctgtcttgag 

agaaaagaaa 

aatatgagag 

7992 

aactattgtt 

tggggaagtt 

caagtgatct 

ttcaatatca 

ttactaactt 

cttccacttt 

8052 
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ttccagaatt tgaatattaa cgctaaaggt gtaagacttc agatttcaaa ttaatctttc 8 112 

tatatttttt aaatttacag aatattatat aacccaotgc tgaaaaagaa aaaaatgatt 8 172 

gttttagaag ttaaagtcaa tattgatttt aaatataagt aatgaaggca tatttccaat 8232 

aactagtgat atggcatcgt tgcattttac agtatcttca aaaatacaga atttatagaa 8292 

taatttctcc tcatttaata tttttcaaaa tcaaagttat ggtttcctca ttttactaaa 8 352 

atcgtattct aattcttcat tatagtaaat ctatgagcaa ctccttactt cggttcctct 8 412 

gatttcaagg ccatatttta aaaaatcaaa aggcactgtg aactattttg aagaaaacac 8 472 

aacattttaa tacagattga aaggacctct tctgaagcta gaaacaatct atagttatac 8532 

atcttcatta at.ac-tgtgt.-t accttttaaa atagtaattt tttacatttt cctgtgtaaa 8592 

cctaattgtg gtagaaattt ttaccaactc tatactcaat caagcaaaat ttctgtatat 8 652 

tccctgtgga atgtacctat gtgagtttca gaaattctca aaatacgtgt tcaaaaattt 8 712 

ctgcttttgc atctttggga cacctcagaa aacttattaa caactgtgaa tatgagaaat 8 772 

acagaagaaa ataataagcc ctctatacat aaatgcccag caoaattcat tgttaaaaaa 8 832 

caaccaaacc tcacactact gtatttcatt atctgtactg aaagcaaatg ctttgtgact 8 892 

attaaatgtt gcacatcatt cattcactgt ats 8 925 

<210> SEQ ID HO 2 8 

<211> LEMGTH: 380 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapien 

<2 2 0> FEATURE: 

<221> NAME/KEY: VARIANT 

<222> LOCATION: 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 

34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 
50, 51, 52, 53, 54 

<223> OTHER IK FORMAT I OK : Xaa = Any Amino Acid 

<4 0 0> SEQUENCE: 2 8 

Ser Leu Ser Ser Thr Gly Ser Leu lie Leu Ser Gly lie Cys Ala lie 
15 10 15 

Lys Leu Phe Pro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
20 25 30 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa lie Phe Pro Phe Gin Cys Phe Gly Leu Leu 
50 55 60 

Gly Val Asn Gly Ala Gly Lys Ser Ser Thr Phe Lys Met Leu Thr Gly 
65 70 75 80 

Asp Thr Thr Val Thr Arg Gly Asp Ala Phe Leu Asn lie Cys Ser lie 
85 90 95 

Leu Ser Asn lie His Glu Val His Gin Asn Met Gly Tyr Cys Pro Gin 
100 105 110 

Phe Asp Ala lie Thr Glu Leu Leu Thr Gly Arg Glu His Val Glu Phe 
115 120 125 

Phe Ala Leu Leu Arg Gly Val Pro Glu Lys Glu Val Gly Lys Val Gly 
130 135 140 

Glu Trp Ala lie Arg Lye Leu Gly Leu Val Lys Tyr Gly Glu Lye Tyr 
145 150 155 160 

Ala Gly Asn Tyr ser Gly Gly Asn Lys Arg Lys Leu ser Thr Ala Met 

165 170 175 
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Ala Leu lie Gly Gly Pro Pro Val Val Phe Leu Asp Glu Pro Thr Thr 
ISO 185 190 

Gly Met Asp Pro Lys Ala Arg Arg Phe Leu Trp Asn Cys Ala Leu Ser 
195 200 205 

Val Val Lys Glu Gly Arg Ser Val Val Leu Thr Ser His Ser Met Glu 
210 215 220 

Glu Cys Glu Ala Leu Cys Thr Arg Met Ala lie Met Val Asn Gly Arg 
225 230 235 240 

Phe Arg Cys Leu Gly Ser Val Gin His Leu Lys Asn Arg Phe Gly Asp 
245 250 255 

Gly Tyr Thr lie Val Val Arg lie Ala Gly Ser Asn Pro Asp Leu Lys 
260 265 270 

Pro Val Gin Asp Phe Phe Gly Leu Ala Phe Pro Gly Ser Val Leu Lys 
275 280 285 

Glu Lys His Arg Asn Met Leu Gin Tyr Gin Leu Pro Ser Ser Leu Ser 
290 295 300 

Ser Leu Ala Arg He Phe Ser He Leu Ser Gin Ser Lys Lys Arg Leu 
305 310 315 320 

His Tie Glu Asp Tyr Ser Val Ser Gin Thr Thr Leu Asp Gin Val Phe 
325 330 335 

Val Asn Phe Ala Lys Asp Gin 

340 

Ser Leu His Lys Asn Gin Thr Val Val Asp Val Ala Val Leu Thr Ser 
355 360 365 

Phe Leu Gin Asp Glu Lys Val Lys Glu Ser Tyr Val 


<210> SEQ ID NO 2 9 

<211> LENGTH: 897 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapien 

<2 2 0> FEATURE: 

<221> NAME/KEY: CDS 

<222> LOCATION: (39) (842 ) 

<223> OTHER INFORMATION: Nucleotide sequence encoding apol ipoprotsin 
A-l (APOA1) 

<4 0 0> SEQUENCE: 2 9 

agagactgcg agaaggaggt cccccacggc ccttcagg atg aaa get gcg gtg ctg 

Met Lys Ala Ala Val Leu 
1 5 

acc ttg gec gtg etc ttc ctg acg ggg age cag get egg cat ttc tgg 
Thr Leu Ala Val Leu Phe Leu Thr Gly Ser Gin Ala Arg His Phe Trp 
10 15 20 

cag caa gat gaa ccc ccc cag age ccc tgg gat cga gtg aag gac ctg 
Gin Gin Asp Glu Pro Pro Gin Ser Pro Trp Asp Arg Val Lys Asp Leu 
25 30 35 

gec act gtg tac gtg gat gtg etc aaa gac age ggc aga gac tat gtg 
Ala Thr Val Tyr Val Asp Val Leu Lys Asp Ser Gly Arg Asp Tyr Val 
40 45 50 

tec cag ttt gaa ggc tec gee ttg gga aaa cag eta aac eta aag etc 

Ser Gin Phe Glu Gly Ser Ala Leu Gly Lys Gin Leu Asn Leu Lys Leu 


ctt gac aac tgg gac age gtg acc tec acc ttc age aag ctg cgc gaa 
Leu Asp Asn Trp Asp ser Val Thr ser Thr Phe ser Lys Leu Arg Glu 

75 80 85 


296 
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-continued 


cag etc ggc cct gtg acc cag gag ttc tgg gat aac ctg gaa aag gag 

Gin Leu Gly Pro Val Thr Gin. Glu Phe Trp Asp Asn Leu Glu Lys Glu 
90 95 100 

aca gag ggc ctg agg cag gag atg age aag gat ctg gag gag gtg aag 

Thr Glu Gly Leu Arg Gin Glu Met Ser Lys Asp Leu Glu Glu Val Lys 
105 110 115 

gec aag gtg cag ccc tac ctg gac gac ttc cag aag aag tgg cag gag 

Ala Lys Val Gin Pro Tyr Leu Asp Asp Phe Gin Lys Lys Trp Gin Glu 

120 125 130 

gag atg gag etc tac cgc cag aag gtg gag ccg ctg cgc gca gag etc 

Glu Met Glu Leu Tyr Arg Gin Lys Val Glu Pro Leu Arg Ala Glu Leu 

135 140 145 150 

caa gag ggc gcg cgc cag aag ctg cac gag ctg caa gag aag ctg age 

Gin Glu Gly Ala Arg Gin Lys Leu His Glu Leu Gin Glu Lys Leu Ser 
155 160 165 

cca ctg ggc gag gag atg cgc gac cgc gcg cgc gec cat gtg gac gcg 

Pro Leu Gly Glu Glu Met Arg Asp Arg Ala Arg Ala His Val Asp Ala 
170 175 180 

ctg cgc acg cat ctg gec ccc tac age gac gag ctg cgc cag cgc ttg 

Leu Arg Thr His Leu Ala Pro Tyr Ser Asp Glu Leu Arg Gin Arg Leu 
185 190 195 

gec gcg cgc ctt gag get etc aag gag aac ggc ggc gec aga ctg gee 

Ala Ala Arg Leu Glu Ala Leu Lys Glu Asn Gly Gly Ala Arg Leu Ala 

200 205 210 

gag tac cac gec aag gec acc gag cat ctg age acg etc age gag aag 

Glu Tyr His Ala Lys Ala Thr Glu His Leu Ser Thr Leu Ser Glu Lys 

215 220 225 230 

gee aag ccc gcg etc gag gac etc cgc caa ggc ctg ctg ccc gtg ctg 

Ala Lys Pro Ala Leu Glu Asp Leu Arg Gin Gly Leu Leu Pro Val Leu 
235 240 245 

gag age ttc aag gtc age ttc ctg age get etc gag gag tac act aag 

Glu Ser Phe Lys Val Ser Phe Leu Ser Ala Leu Glu Glu Tyr Thr Lys 
250 255 260 

aag etc aac acc cag tga ggcgcccgcc gccgcccccc ttcccggtgc 

Lys Leu Asn Thr Gin * 
265 

tcagaataaa cgtttccaaa gtggg 


<210> SEQ ID HO 3 0 

<211> LENGTH: 267 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapien 

<400> SEQUENCE: 3 0 

Met Lys Ala Ala Val Leu Thr Leu Ala Val Leu Phe Leu Thr Gly Ser 

15 10 15 

Gin Ala Arg His Phe Trp Gin Gin Asp Glu Pro Pro Gin Ser Pro Trp 
20 25 30 

Asp Arg Val Lys Asp Leu Ala Thr Val Tyr Val Asp Val Leu Lys Asp 
35 40 45 

Ser Gly Arg Asp Tyr Val Ser Gin Phe Glu Gly Ser Ala Leu Gly Lys 
50 55 60 

Gin Leu Asn Leu Lys Leu Leu Asp Asn Trp Asp Ser Val Thr Ser Thr 


Phe ser Lys Leu Arg Glu Gin Leu Gly Pro val Thr Gin Glu Phe Trp 
85 90 95 
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-continued 


Asp Asn Leu. Glu Lys Glu Thr Glu Gly Leu Arg Gin Glu Met Ser Lys 
100 105 110 

Asp Leu Glu Glu Val Lys Ala. Lys Val Gin Pro Tyr Leu Asp Asp Phe 
115 120 125 

Gin Lys Lys Trp Gin Glu Glu Met Glu Leu Tyr Arg Gin Lys Val Glu 
130 135 140 

Pro Leu Arg Ala Glu Leu Gin Glu Gly Ala Arg Gin Lys Leu His Glu 
145 150 155 160 

Leu Gin Glu Lys Leu Ser Pro Leu Gly Glu Glu Met Arg Asp Arg Ala 

165 170 175 

Arg Ala His Val Asp Ala Leu Arg Thr His Leu Ala Pro Tyr Ser Asp 
180 185 190 

Glu Leu Arg Gin Arg Leu Ala Ala Arg Leu Glu Ala Leu Lys Glu Asn 
195 200 205 

Gly Gly Ala Arg Leu Ala Glu Tyr His Ala Lys Ala Thr Glu His Leu 
210 215 220 

Ser Thr Leu Ser Glu Lys Ala Lys Pro Ala Leu Glu Asp Leu Arg Gin 
225 230 235 240 

Gly Leu Leu Pro Val Leu Glu Ser Phe Lys Val Ser Phe Leu Ser Ala 
245 250 255 

Leu Glu Glu Tyr Thr Lys Lys Leu Asn Thr Gin 
260 265 


<210> SEQ ID NO 31 

<211> LENGTH: 14121 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapien 

<2 2 0> FEATURE: 

<221> NAME/KEY: CDS 

<2 22> LOCATION: ( 12 9 ) . . . ( 13 820 ) 

<223> OTHER INFORMATION: nucleotide sequence encoding apolipoprotein B 
(APOB) 

<4 0 0> SEQUENCE: 31 

attcccaccg ggacctgcgg ggctgagtgc ccttctcggt tgctgccgct gaggagcccg 6 0 

cccagccagc cagggccgcg aggccgaggc caggccgcag cccaggagcc gccccaccgc 120 

agctggcg atg gac ccg ccg agg ccc gcg ctg ctg gcg ctg ctg gcg ctg 170 
Met Asp Pro Pro Arg Pro Ala Leu Leu Ala Leu Leu Ala Leu 
15 10 

cct gcg ctg ctg ctg ctg ctg ctg gcg ggc gcc agg gcc gaa gag gaa 218 
Pro Ala Leu Leu Leu Leu Leu Leu Ala Gly Ala Arg Ala Glu Glu Glu 
15 20 25 30 

atg ctg gaa aat gtc age ctg gtc tgt cca aaa gat gcg acc cga ttc 26 6 

Met Leu Glu Asn Val Ser Leu Val Cys Pro Lys Asp Ala Thr Arg Phe 
35 40 45 

aag cac etc egg aag tac aca tac aac tat gag get gag agt tec agt 314 
Lys His Leu Arg Lys Tyr Thr Tyr Asn Tyr Glu Ala Glu Ser Ser Ser 
50 55 60 

gga gtc cct ggg act get gat tea aga agt gcc acc agg ate aac tgc 362 
Gly Val Pro Gly Thr Ala Asp Ser Arg Ser Ala Thr Arg lie Asn Cys 
65 70 75 

aag gtt gag ctg gag gtt ccc cag etc tgc age ttc ate ctg aag acc 410 
Lys Val Glu Leu Glu Val Pro Gin Leu Cys Ser Phe lie Leu Lys Thr 


age cag tgc acc ctg aaa gag gtg tat ggc ttc aac cct gag ggc aaa 


458 
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-continued 

Ser Gin Cys Thr Leu Lye Glu Val Tyr Gly Phe Asn Pro Glu Gly Lys 

95 100 105 110 

gcc ttg ctg aag aaa ace aag aac tct gag gag ttt get gca gec atg 

Ala Leu Leu Lys Lys Thr Lys Asn Ser Glu Glu Phe Ala Ala Ala Met 

115 120 125 

tec agg tat gag etc aag ctg gcc att cca gaa ggg aag cag gtt ttc 

Ser Arg Tyr Glu Leu Lys Leu Ala lie Pro Glu Gly Lys Gin Val Phe 

130 135 140 

ctt tac ccg gag aaa gat gaa cot act tac ate ctg aac ate aag agg 

Leu Tyr Pro Glu Lys Asp Glu Pra Thr Tyr lie Leu Asn lie Lys Arg 

145 150 155 

ggc ate att tct gcc etc ctg gtt ccc cca gag aca gaa gaa gcc aag 

Gly He lie Ser Ala Leu Leu Val Pro Pro Glu Thr Glu Glu Ala Lys 

160 165 170 

caa gtg ttg ttt ctg gat acc gtg tat gga aac tgc tec act cac ttt 

Gin Val Leu Phe Leu Asp Thr Val Tyr Gly Abu Cys Ser Thr His Phe 

175 ISO 1S5 190 

acc gtc aag acg agg aag ggc aat gtg gca aca gaa ata tec act gaa 

Thr Val Lys Thr Arg Lys Gly Asn Val Ala Thr Glu He Ser Thr Glu 

195 200 205 

aga gac ctg ggg cag tgt gat cgc ttc aag ccc ate cgc aca ggc ate 

Arg Asp Leu Gly Gin Cys Asp Arg Phe Lys Pro Tie Arg Thr Gly Tie 

210 215 220 

age cca ctt get ate ate aaa ggc atg acc cgc ccc ttg tea act ctg 

Ser Pro Leu Ala Leu He Lys Gly Met Thr Arg Pro Leu Ser Thr Leu 

225 230 235 

ate age age age cag tec tgt cag tac aca ctg gac get aag agg aag 

lie Ser Ser Ser Gin Ser Cys Gin Tyr Thr Leu Asp Ala Lys Arg Lys 

240 245 250 

cat gtg gca gaa gcc ate tgc aag gag caa cac etc ttc ctg cct ttc 

His Val Ala Glu Ala He Cys Lys Glu Gin His Leu Phe Leu Pro Phe 

255 260 265 270 

tec tac aac aat aag tat ggg atg gta gca caa gtg aca cag act ttg 

Ser Tyr Asn Asn Lys Tyr Gly Met Val Ala Gin Val Thr Gin Thr Leu 

275 280 285 

aaa ctt gaa gac aca cca aag ate aae age cgc ttc ttt ggt gaa ggt 

Lys Leu Glu Asp Thr Pro Lys He Asn Ser Arg Phe Phe Gly Glu Gly 

290 295 300 

act aag aag atg ggc etc gca ttt gag age acc aaa tec aca tea cct 

Thr Lys Lys Met Gly Leu Ala Phe Glu Ser Thr Lys Ser Thr Ser Pro 

305 310 315 

cca aag cag gcc gaa get gtt ttg aag act etc cag gaa ctg aaa aaa 

Pro Lys Gin Ala Glu Ala Val Leu Lys Thr Leu Gin Glu Leu Lys Lys 

320 325 330 

eta acc ate tct gag caa aat ate cag aga get aat etc ttc aat aag 

Leu Thr lie Ser Glu Gin Asn He Gin Arg Ala Asn Leu Phe Asn Lys 

335 340 345 350 

ctg gtt act gag ctg aga ggc etc agt gat gaa gca gtc aca tct etc 

Leu Val Thr Glu Leu Arg Gly Leu Ser Asp Glu Ala Val Thr Ser Leu 

355 360 365 

ttg cca cag ctg att gag gtg tec age ccc ate act tta caa gcc ttg 

Leu Pro Gin Leu He Glu Val Ser Ser Pro He Thr Leu Gin Ala Leu 

370 375 380 

gtt cag tgt gga cag cct cag tgc tec act cac ate etc cag tgg ctg 

Val Gin Cys Gly Gin Pro Gin Cys Ser Thr His He Leu Gin Trp Leu 

385 390 395 

aaa cgt gtg cat gcc aac ccc ctt ctg ata gat gtg gtc acc tac ctg 
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-continued 

Lys Arg Val His Ala Asn Pro Leu Leu lie Asp Val Val Thr Tyr Leu 

400 405 410 

gtg gcc ctg ate ncc gag ccc tea gca cag cag ctg cga gag ate ttc 

Val Ala Leu lie Pro Glu Pro Ser Ala Gin Gin Leu Arg Glu lie Phe 

415 420 425 430 

aac atg gcg agg gat cag cgc age cga gcc acc ttg tat gcg ctg age 

Asn Met Ala Arg Asp Gin Arg Ser Arg Ala Thr Leu Tyr Ala Leu Ser 

435 440 445 

cac gcg gtc aac aac tat cat aag aca aac cct aca ggg acc cag gag 

His Ala Val Asn Asn Tyr His Lys Thr Asn Pro Thr Gly Thr Gin Glu 

450 455 460 

ctg ctg gac att get aat tac ctg atg gaa cag att caa gat gac tgc 

Leu Leu Asp lie Ala Asn Tyr Leu Met Glu Gin lie Gin Asp Asp Cys 

465 470 475 

act ggg gat gaa gat tac acc tat ttg att ctg egg gtc att gga aat 

Thr Gly Asp Glu Asp Tyr Thr Tyr Leu lie Leu Arg Val lie Gly Asn 

4S0 485 490 

atg ggc caa acc atg gag cag tta act cca gaa etc aag tct tea ate 

Met Gly Gin Thr Met Glu Gin Leu Thr Pro Glu Leu Lys Ser Ser lie 

495 500 505 510 

etc aaa tgt gtc caa agt aca aag cca tea ctg atg ate cag aaa get 

Leu Lys Cys Val Gin Ser Thr Lys Pro Ser Leu Met lie Gin Lys Ala 

515 520 525 

gcc ate cag get ctg egg aaa atg gag cct aaa gac aag gac cag gag 

Ala lie Gin Ala Leu Arg Lys Met Glu Pro Lys Asp Lys Asp Gin Glu 

530 535 540 

gtt ctt ctt cag act ttc ctt gat gat get tct ccg gga gat aag cga 

Val Leu Leu Gin Thr Phe Leu Asp Asp Ala Ser Pro Gly Asp Lys Arg 

545 550 555 

ctg get gcc tat ctt atg ttg atg agg agt cct tea cag gca gat att 

Leu Ala Ala Tyr Leu Met Leu Met Arg Ser Pro Ser Gin Ala Asp lie 

560 565 570 

aac aaa att gtc caa att eta cca tgg gaa cag aat gag caa gtg aag 

Asn Lys lie Val Gin lie Leu Pro Trp Glu Gin Asn Glu Gin Val Lys 

575 580 585 590 

aac ttt gtg get tec cat att gcc aat ate ttg aac tea gaa gaa ttg 

Asn Phe Val Ala Ser His lie Ala Asn lie Leu Asn Ser Glu Glu Leu 

595 600 605 

gat ate caa gat ctg aaa aag tta gtg aaa gaa get ctg aaa gaa tct 

Asp He Gin Asp Leu Lys Lys Leu Val Lys Glu Ala Leu Lys Glu Ser 

610 615 620 

caa ctt cca act gtc atg gac ttc aga aaa ttc tct egg aac tat caa 

Gin Leu Pro Thr Val Met Asp Phe Arg Lys Phe Ser Arg Asn Tyr Gin 

625 630 635 

etc tac aaa tct gtt tct ctt cca tea ctt gac cca gcc tea gcc aaa 

Leu Tyr Lys Ser Val Ser Leu Pro Ser Leu Asp Pro Ala Ser Ala Lys 

640 645 650 

ata gaa ggg aat ctt ata ttt gat cca aat aac tac ctt cct aaa gaa 

He Glu Gly Asn Leu He Phe Asp Pro Asn Asn Tyr Leu Pro Lys Glu 

655 660 665 670 

age atg ctg aaa act aec etc act gcc ttt gga ttt get tea get gac 

Ser Met Leu Lys Thr Thr Leu Thr Ala Phe Gly Phe Ala Ser Ala Asp 

675 680 635 

etc ate gag att ggc ttg gaa gga aaa ggc ttt gag cca aca ttg gaa 

Leu He Glu He Gly Leu Glu Gly Lys Gly Phe Glu Pro Thr Leu Glu 

690 695 700 

get ctt ttt ggg aag caa gga ttt ttc cca gac agt gtc aac aaa get 
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-continued 

Ala Leu Phe Gly Lys Gin Gly Phe Phe Pro Asp Ser Val Asn Lys Ala 

705 710 715 

ttg tac tgg gtt aat ggt caa gtt cct gat ggt gtc tct aag gtc tta 

Leu Tyr Trp Val Asn Gly Gin Val Pro Asp Gly Val Ser Lys Val Leu 

720 725 730 

gtg gac cac ttt ggc tat ace aaa gat gat aaa cat gag cag gat atg 

Val Asp His Phe Gly Tyr Thr Lys Asp Asp Lys His Glu Gin Asp Met 

735 740 745 750 

gta aat gga ata atg etc agt gtt gag aag ctg att aaa gat ttg aaa 

Val Asn Gly lie Met Leu Ser Val Glu Lys Leu lie Lys Asp Leu Lys 

755 760 765 

tec aaa gaa gtc ccg gaa gec aga gec tac etc cgc ate ttg gga gag 

Ser Lys Glu Val Pro Glu Ala Arg Ala Tyr Leu Arg lie Leu Gly Glu 

770 775 780 

gag ctt ggt ttt gec agt etc cat gac etc cag etc ctg gga aag ctg 

Glu Leu Gly Phe Ala Ser Leu His Asp Leu Gin Leu Leu Gly Lys Leu 

785 790 795 

ctt ctg atg ggt gec cgc act ctg cag ggg ate ccc cag atg att gga 

Leu Leu Met Gly Ala Arg Thr Leu Gin Gly lie Pro Gin Met lie Gly 

800 805 810 

gag gtc ate agg aag ggc tea aag aat gac ttt ttt ctt cac tac ate 

Glu Val He Arg Lys Gly Ser Lys Asn Asp Phe Phe Leu His Tyr Tie 

815 820 325 830 

ttc atg gag aat gec ttt gaa etc ece act gga get gga tta cag ttg 

Phe Met Glu Asn Ala Phe Glu Leu Pro Thr Gly Ala Gly Leu Gin Leu 

835 840 845 

caa ata tct tea tct gga gtc att get ccc gga gec aag get gga gta 

Gin lie Ser Ser Ser Gly Val lie Ala Pro Gly Ala Lys Ala Gly Val 

850 855 860 

aaa ctg gaa gta gee aac atg cag get gaa ctg gtg gca aaa ccc tec 

Lys Leu Glu Val Ala Asn Met Gin Ala Glu Leu Val Ala Lys Pro Ser 

865 870 875 

gtg tct gtg gag ttt gtg aca aat atg ggc ate ate att ccg gac ttc 

Val Ser Val Glu Phe Val Thr Asn Met Gly He He lie Pro Asp Phe 

880 885 890 

get agg agt ggg gtc cag atg aac ace aac ttc ttc cac gag teg ggt 

Ala Arg Ser Gly Val Gin Met Asn Thr Asn Phe Phe His Glu Ser Gly 

895 900 905 910 

ctg gag get cat gtt gec eta aaa get ggg aag ctg aag ttt ate att 

Leu Glu Ala His Val Ala Leu Lys Ala Gly Lys Leu Lys Phe lie He 

915 920 925 

cct tec cca aag aga cca gtc aag ctg etc agt gga ggc aac aca tta 

Pro Ser Pro Lys Arg Pro Val Lys Leu Leu Ser Gly Gly Asn Thr Leu 

930 935 940 

cat ttg gtc tct ace acc aaa acg gag gtg ate cca cct etc att gag 

His Leu Val Ser Thr Thr Lys Thr Glu Val lie Pro Pro Leu lie Glu 

945 950 955 

aac agg cag tec tgg tea gtt tgc aag caa gtc ttt cct ggc ctg aat 

Asn Arg Gin Ser Trp Ser Val Cys Lys Gin Val Phe Pro Gly Leu Asn 

960 965 970 

tac tge acc tea ggc get tac tec aac gec age tec aca gac tec gec 

Tyr Cys Thr Ser Gly Ala Tyr Ser Asn Ala Ser Ser Thr Asp Ser Ala 

975 980 985 990 

tec tac tat ccg ctg acc ggg gac acc aga tta gag ctg gaa ctg agg 

Ser Tyr Tyr Pro Leu Thr Gly Asp Thr Arg Leu Glu Leu Glu Leu Arg 

995 1000 1005 

cct aca gga gag att gag cag tat tct gtc age gca acc tat gag etc 
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Pro Thr Gly Glu lie Glu Gin Tyr Ser Val Ser Ala Thr Tyr Glu Leu 
1010 1015 1020 

cag aga gag gac aga gee ttg gtg gat ace ctg aag ttt gta act caa 

Gin Arg Glu Asp Arg Ala Leu Val Asp Thr Leu Lys Phe Val Thr Gin 
1025 1030 1035 

gca gaa ggt gcg aag cag act gag get acc atg aca ttc aaa tat aat 

Ala Glu Gly Ala Lys Gin Thr Glu Ala Thr Met Thr Phe Lys Tyr Asn 

1040 1045 1050 

egg cag agt atg acc ttg tec agt gaa gtc caa att ccg gat ttt gat 

Arg Gin Ser Met Thr Leu Ser Ser Glu Val Gin lie Pro Asp Phe Asp 

1055 1060 1065 1070 

gtt gac etc gga aca ate etc aga gtt aat gat gaa tct act gag ggc 

Val Asp Leu Gly Thr He Leu Arg Val Asn Asp Glu Ser Thr Glu Gly 
1075 1080 1085 

aaa acg tct tac aga etc acc ctg gac att cag aac aag aaa att act 

Lye Thr Ser Tyr Arg Leu Thr Leu Asp He Gin Asn Lys Lys lie Thr 
1090 1095 1100 

gag gtc gec etc atg ggc cac eta agt tgt gac aca aag gaa gaa aga 

Glu Val Ala Leu Met Gly His Leu Ser Cys Asp Thr Lys Glu Glu Arg 
1105 1110 1115 

aaa ate aag ggt gtt att tec ata ccc cgt ttg caa gca gaa gee aga 

Lys He Lys Gly Val lie Ser Tie Pro Arg Leu Gin Ala Glu Ala Arg 

1120 1125 1130 

agt gag ate etc gee cac tgg teg act gee aaa ctg ctt etc caa atg 

Ser Glu He Leu Ala Hie Trp Ser Pro Ala Lys Leu Leu Leu Gin Met 

1135 1140 1145 1150 

gac tea tct get aca get tat ggc tec aca gtt tec aag agg gtg gca 

Asp Ser Ser Ala Thr Ala Tyr Gly Ser Thr Val Ser Lys Arg Val Ala 
1155 1160 1165 

tgg cat tat gat gaa gag aag att gaa ttt gaa tgg aac aca ggc acc 

Trp His Tyr Asp Glu Glu Lys He Glu Phe Glu Trp Asn Thr Gly Thr 
1170 1175 1180 

aat gta gat acc aaa aaa atg act tec aat ttc cct gtg gat etc tec 

Asn Val Asp Thr Lys Lys Met Thr Ser Asn Phe Pro Val Asp Leu Ser 
1185 1190 1195 

gat tat cct aag age ttg cat atg tat get aat aga etc ctg gat cac 

Asp Tyr Pro Lys Ser Leu His Met Tyr Ala Asn Arg Leu Leu Asp His 

1200 1205 1210 

aga. gtc cct gaa aca gac atg act ttc egg cac gtg ggt tec aaa tta 

Arg Val Pro Glu Thr Asp Met Thr Phe Arg His Val Gly Ser Lys Leu 

1215 1220 1225 1230 

ata gtt gca atg age tea tgg ctt cag aag gca tct ggg agt ctt cct 

lie Val Ala Met Ser Ser Trp Leu Gin Lys Ala Ser Gly Ser Leu Pro 
1235 1240 1245 

tat acc cag act ttg caa gac cac etc aat age ctg aag gag ttc aac 

Tyr Thr Gin Thr Leu Gin Asp His Leu Asn Ser Leu Lys Glu Phe Asn 
1250 1255 1260 

etc cag aac atg gga ttg cca gac ttc cac ate cca gaa aac etc ttc 

Leu Gin Asn Met Gly Leu Pro Asp Phe His He Pro Glu Asn Leu Phe 
1265 1270 1275 

tta aaa age gat ggc egg gtc aaa tat ace ttg aac aag aac agt ttg 

Leu Lys Ser Asp Gly Arg Val Lys Tyr Thr Leu Asn Lys Asn Ser Leu 

1230 1285 1290 

aaa att gag att cct ttg cct ttt ggt ggc aaa tec tec aga gat eta 

Lys He Glu He Pro Leu Pro Phe Gly Gly Lys Ser Ser Arg Asp Leu 

1295 1300 1305 1310 

aag atg tta gag act gtt agg aca cca gee etc cac ttc aag tct gtg 
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-continued 

Lys Met Leu Glu Thr Val Arg Thr Pro Ala Leu His Phe Lys Ser Val 

1315 1320 1325 

gga ttc cat ctg cca tct cga gag ttc caa gtc cct act ttt ace att 

Gly Phe His Leu Pro Ser Arg Glu Phe Gin Val Pro Thr Phe Thr lie 

1330 1335 1340 

ccc aag ttg tat caa ctg caa gtg cct etc ctg ggt gtt eta gac etc 

Pro Lys Leu Tyr Gin Leu Gin Val Pro Leu Leu Gly Val Leu Asp Leu 

1345 1350 1355 

tec acg aat gtc tac age aac ttg tac aac tgg tec gee tec tac agt 

Ser Thr Asn Val Tyr Ser Asn Leu Tyr Asn Trp Ser Ala Ser Tyr Ser 

1360 1365 1370 

ggt ggc aac ace age aca gac cat ttc age ctt egg get cgt tac cac 

Gly Gly Asn Thr Ser Thr Asp His Phe Ser Leu Arg Ala Arg Tyr His 

1375 1380 1385 1390 

atg aag get gac tct gtg gtt gac ctg ctt tec tac aat gtg caa gga 

Met Lys Ala Asp Ser Val Val Asp Leu Leu Ser Tyr Asn Val Gin Gly 

1395 1400 1405 

tct gga gaa aca aca tat gac cac aag aat acg ttc aca eta tea tgt 

Ser Gly Glu Thr Thr Tyr Asp His Lys Asn Thr Phe Thr Leu Ser Cys 

1410 1415 1420 

gat ggg tct eta cgc cac aaa ttt eta gat teg aat ate aaa ttc agt 

Asp Gly Ser Leu Arg His Lys Phe Leu Asp Ser Asn lie Lys Phe Ser 

1425 1430 1435 

cat gta gaa aaa ctt gga aac aac cca gtc tea aaa ggt tta eta ata 

His Val Glu Lys Leu Gly Asn Asn Pro Val Ser Lys Gly Leu Leu lie 

1440 1445 1450 

ttc gat gca tct agt tec tgg gga cca cag atg tct get tea gtt cat 

Phe Asp Ala Ser Ser Ser Trp Gly Pro Gin Met Ser Ala Ser Val His 

1455 1460 1465 1470 

ttg gac tec aaa aag aaa cag cat ttg ttt gtc aaa gaa gtc aag att 

Leu Asp Ser Lys Lys Lys Gin His Leu Phe Val Lys Glu Val Lys lie 

1475 1480 1485 

gat ggg cag ttc aga gtc tct teg ttc tat get aaa ggc aca tat ggc 

Asp Gly Gin Phe Arg Val Ser Ser Phe Tyr Ala Lys Gly Thr Tyr Gly 

1490 1495 1500 

ctg tct tgt cag agg gat cct aac act ggc egg etc aat gga gag tec 

Leu Ser Cys Gin Arg Asp Pro Asn Thr Gly Arg Leu Asn Gly Glu Ser 

1505 1510 1515 

aac ctg agg ttt aac tec tec tac etc caa ggc acc aac cag ata aca 

Asn Leu Arg Phe Asn Ser Ser Tyr Leu Gin Gly Thr Asn Gin lie Thr 

1520 1525 1530 

gga aga tat gaa gat gga acc etc tec etc acc tec acc tct gat ctg 

Gly Arg Tyr Glu Asp Gly Thr Leu Ser Leu Thr Ser Thr Ser Asp Leu 

1535 1540 1545 1550 

caa agt ggc ate att aaa aat act get tec eta aag tat gag aac tac 

Gin Ser Gly lie lie Lys Asn Thr Ala Ser Leu Lys Tyr Glu Asn Tyr 

1555 1560 1565 

gag ctg act tta aaa tct gac acc aat ggg aag tat aag aac ttt gee 

Glu Leu Thr Leu Lys Ser Asp Thr Asn Gly Lys Tyr Lys Asn Phe Ala 

1570 1575 1580 

act tct aac aag atg gat atg acc ttc tct aag caa aat gca ctg ctg 

Thr Ser Asn Lys Met Asp Met Thr Phe Ser Lys Gin Asn Ala Leu Leu 

1585 1590 1595 

cgt tct gaa tat cag get gat tac gag tea ttg agg ttc ttc age ctg 

Arg Ser Glu Tyr Gin Ala Asp Tyr Glu Ser Leu Arg Phe Phe Ser Leu 

1600 1605 1610 

ctt tct gga tea eta aat tec cat ggt ctt gag tta aat get gac ate 
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Leu Ser Gly Ser Leu Asn Ser His Gly Leu Glu Leu Asn Ala Asp lie 

1615 1620 1625 1630 

tta ggc act gac aaa att aat agt ggt get cac aag gcg aca eta agg 

Leu Gly Thr Asp Lys lie Asn Ser Gly Ala His Lys Ala Thr Leu Arg 
1635 1640 1645 

att ggc caa gat gga ata tct acc agt gca acg acc aac ttg aag tgt 

lie Gly Gin Asp Gly lie Ser Thr Ser Ala Thr Thr Asn Leu Lys Cys 
1650 1655 1660 

agt etc ctg gtg ctg gag aat gag ctg aat gca gag ctt ggc etc tct 

Ser Leu Leu Val Leu Glu Asn. Glu Leu Asn Ala Glu Leu Gly Leu Ser 
1665 1670 1675 

ggg gca tct atg aaa tta aca aca aat ggc cgc ttc agg gaa cac aat 

Gly Ala Ser Met Lys Leu Thr Thr Asn Gly Arg Phe Arg Glu His Asn 

16B0 1685 1690 

gca aaa ttc agt ctg gat ggg aaa gec gec etc aca gag eta tea ctg 

Ala Lys Phe Ser Leu Asp Gly Lys Ala Ala Leu Thr Glu Leu Ser Leu 

1695 1700 1705 1710 

gga agt get tat cag gec atg att ctg ggt gtc gac age aaa aac att 

Gly Ser Ala Tyr Gin Ala Met lie Leu Gly Val Asp Ser Lys Asn lie 
1715 1720 1725 

ttc aac ttc aag gtc agt caa gaa gga ctt aag etc tea aat gac atg 

Phe Asn Phe Lys Val Ser Gin Glu Gly Leu Lys Leu Ser Asn Asp Met 
1730 1735 1740 

atg ggc tea tat get gaa atg aaa ttt gac cac aca aac agt ctg aac 

Met Gly Ser Tyr Ala Glu Met Lys Phe Asp His Thr Asn Ser Leu Asn 
1745 1750 1755 

att gca ggc tta tea ctg gac ttc tct tea aaa ctt gac aac att tac 

lie Ala Gly Leu Ser Leu Asp Phe Ser Ser Lys Leu Asp Asn lie Tyr 

1760 1765 1770 

age tct gac aag ttt tat aag caa act gtt aat tta cag eta cag ccc 

Ser Ser Asp Lys Phe Tyr Lys Gin Thr Val Asn Leu Gin Leu Gin Pro 

1775 1780 1785 1790 

tat tct ctg gta act act tta aac agt gac ctg aaa tac aat get ctg 

Tyr Ser Leu Val Thr Thr Leu Asn Ser Asp Leu Lys Tyr Asn Ala Leu 
1795 1800 1805 

gat etc acc aac aat ggg aaa eta egg eta gaa ccc ctg aag ctg cat 

Asp Leu Thr Asn Asn Gly Lys Leu Arg Leu Glu Pre- Leu Lys Leu His 
1810 1815 1820 

gtg get ggt aac eta aaa gga gec tac caa aat aat gaa ata aaa cac 

Val Ala Gly Asn Leu Lys Gly Ala Tyr Gin Asn Asn Glu lie Lys His 
1825 1830 1835 

ate tat gee ate tct tct get gee tta tea gca age tat aaa gca gac 

lie Tyr Ala lie Ser Ser Ala Ala Leu Ser Ala Ser Tyr Lys Ala Asp 

1840 1845 1850 

act gtt get aag gtt cag ggt gtg gag ttt age cat egg etc aac aca 

Thr Val Ala Lys Val Gin Gly Val Glu Phe Ser His Arg Leu Asn Thr 

1855 I860 1865 1870 

gac ate get ggg ctg get tea gee att gac atg age aca aac tat aat 

Asp lie Ala Gly Leu Ala Ser Ala lie Asp Met Ser Thr Asn Tyr Asn 
1875 1880 1885 

tea gae tea ctg cat ttc age aat gtc ttc cgt tct gta atg gec ccg 

Ser Asp Ser Leu His Phe Ser Asn Val Phe Arg Ser Val Met Ala Pro 
1890 1895 1900 

ttt acc atg acc ate gat gca cat aca aat ggc aat ggg aaa etc get 

Phe Thr Met Thr lie Asp Ala His Thr Asn Gly Asn Gly Lys Leu Ala 
1905 1910 1915 

etc tgg gga gaa cat act ggg cag ctg tat age aaa ttc ctg ttg aaa 
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Leu Trp Gly Glu His Thr Gly Gin Leu Tyr Ser Lys Phe Leu Leu Lys 

1920 1925 1930 

gca gaa cct etg gca ttt act tte tct cat gat tao aaa ggc tec aca 

Ala Glu Pro Leu Ala Phe Thr Phe Ser His Asp Tyr Lys Gly Ser Thr 

1935 1940 1945 1950 

agt cat cat etc gtg tct agg aaa age ate agt gca get ctt gaa cac 

Ser His His Leu Val Ser Arg Lys Ser lie Ser Ala Ala Leu Glu His 

1955 I960 1965 

aaa gtc agt gee ctg ctt act cca get gag cag aca ggc acc tgg aaa 

Lys Val Ser Ala Leu Leu Thr Pra Ala Glu Gin Thr Gly Thr Trp Lys 

1970 1975 1980 

etc aag acc caa ttt aac aac aat gaa tac age cag gac ttg gat get 

Leu Lys Thr Gin Phe Asn Asn Asn Glu Tyr Ser Gin Asp Leu Asp Ala 

1985 1990 1995 

tac aac act aaa gat aaa att ggc gtg gag ctt act gga cga act ctg 

Tyr Aen Thr Lys Asp Lys lie Gly Val Glu Leu Thr Gly Arg Thr Leu 

2000 2005 2010 

get gac eta act eta eta gac tec cca att aaa gtg cca ctt tta etc 

Ala Asp Leu Thr Leu Leu Asp Ser Pro lie Lys Val Pro Leu Leu Leu 

2015 2020 2025 2030 

agt gag ccc ate aat ate att gat get tta gag atg aga gat gec gtt 

Ser Glu Pro lie Asn He lie Asp Ala Leu Glu Met Arg Asp Ala Val 

2035 2040 2045 

gag aag ccc caa gaa ttt aca att gtt get ttt gta aag tat gat aaa 

Glu Lys Pro Gin Glu Phe Thr He Val Ala Phe Val Lys Tyr Asp Lys 

2050 2055 2060 

aac caa gat gtt cac tec att aac etc cca ttt ttt gag acc ttg caa 

Asn Gin Asp Val His Ser He Asn Leu Pro Phe Phe Glu Thr Leu Gin 

2065 2070 2075 

gaa tat ttt gag agg aat cga caa acc att ata gtt gta gtg gaa aac 

Glu Tyr Phe Glu Arg Asn Arg Gin Thr He He Val Val Val Glu Asn 

2080 2085 2090 

gta cag aga aac ctg aag cac ate aat att gat caa ttt gta aga aaa 

Val Gin Arg Asn Leu Lys His lie Asn lie Asp Gin Phe Val Arg Lys 

2095 2100 2105 2110 

tac aga gca gee ctg gga aaa etc cca cag caa get aat gat tat ctg 

Tyr Arg Ala Ala Leu Gly Lys Leu Pro Gin Gin Ala Asn Asp Tyr Leu 

2115 2120 2125 

aat tea ttc aat tgg gag aga caa gtt tea cat gec aag gag aaa etg 

Asn Ser Phe Asn Trp Glu Arg Gin Val Ser His Ala Lys Glu Lys Leu 

2130 2135 2140 

act get etc aca aaa aag tat aga att aca gaa aat gat ata caa att 

Thr Ala Leu Thr Lys Lys Tyr Arg He Thr Glu Asn Asp He Gin He 

2145 2150 2155 

gca tta gat gat gee aaa ate aac ttt aat gaa aaa eta tct caa ctg 

Ala Leu Asp Asp Ala Lys lie Asn Phe Asn Glu Lys Leu Ser Gin Leu 

2160 2165 2170 

cag aca tat atg ata caa ttt gat cag tat att aaa gat agt tat gat 

Gin Thr Tyr Met lie Gin Phe Asp Gin Tyr lie Lys Asp Ser Tyr Asp 

2175 2180 2185 2190 

tta eat gat ttg aaa ata get att get aat att att gat gaa ate att 

Leu His Asp Leu Lys He Ala He Ala Asn He He Asp Glu He He 

2195 2200 2205 

gaa aaa tta aaa agt ctt gat gag cac tat cat ate cgt gta aat tta 

Glu Lys Leu Lys Ser Leu Asp Glu His Tyr His He Arg Val Asn Leu 

2210 2215 2220 

gta aaa aca ate cat gat eta cat ttg ttt att gaa aat att gat ttt 
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Val Lys Thr lie His Asp Leu His Leu Phe lie Glu Asn lie Asp Phe 
2225 2230 2235 

aac aaa agt gga agt agt act gca tec tgg att caa aat gtg gat act 

Asn Lys Eer Gly Ser Ser Thr Ala Ser Trp lie Gin Asn Val Asp Thr 

2240 2245 2250 

aag tac caa ate aga ate cag ata caa gaa aaa ctg cag cag ctt aag 

Lys Tyr Gin He Arg He Gin He Gin Glu Lys Leu Gin Gin Leu Lys 

2255 2260 2265 2270 

aga cac ata cag aat ata gac ate cag cac eta get gga aag tta aaa 

Arg His lie Gin Asn lie Asp lie Gin His Leu Ala Gly Lys Leu Lys 
2275 2280 2285 

caa cac att gag get att gat gtt aga gtg ctt tta gat caa ttg gga 

Gin His lie Glu Ala lie Asp Val Arg Val Leu Leu Asp Gin Leu Gly 
2290 2295 2300 

act aca att tea ttt gaa aga ata aat gat gtt ctt gag cat gtc aaa 

Thr Thr lie Ser Phe Glu Arg He Asn Asp Val Leu Glu His Val Lys 
2305 2310 2315 

cac ttt gtt ata aat ctt att ggg gat ttt gaa gta get gag aaa ate 

His Phe Val He Asn Leu lie Gly Asp Phe Glu Val Ala Glu Lys He 

2320 2325 2330 

aat gec ttc aga gec aaa gtc cat gag tta ate gag agg tat gaa gta 

Asn Ala Phe Arg Ala Lys Val His Glu Leu lie Glu Arg Tyr Glu Val 

2335 2340 2345 2350 

gac caa caa ate cag gtt tta atg gat aaa tta gta gag ttg ace cac 

Asp Gin Gin He Gin Val Leu Met Asp Lys Leu Val Glu Leu Thr His 
2355 2360 2365 

caa tac aag ttg aag gag act att cag aag eta age aat gtc eta caa 

Gin Tyr Lys Leu Lys Glu Thr He Gin Lys Leu Ser Asn Val Leu Gin 
2370 2375 2380 

caa gtt aag ata aaa gat tac ttt gag aaa ttg gtt gga ttt att gat 

Gin Val Lys He Lys Asp Tyr Phe Glu Lys Leu Val Gly Phe He Asp 
2385 2390 2395 

gat get gtg aag aag ctt aat gaa tta tct ttt aaa aca ttc att gaa 

Asp Ala Val Lys Lys Leu Asn. Glu Leu Ser Phe Lys Thr Phe lie Glu 

2400 2405 2410 

gat gtt aac aaa ttc ctt gae atg ttg ata aag aaa tta aag tea ttt 

Asp Val Asn Lys Phe Leu Asp Met Leu lie Lys Lys Leu Lys Ser Phe 

2415 2420 2425 2430 

gat tac cac cag ttt gta gat gaa acc aat gac aaa ate cgt gag gtg 

Asp Tyr His Gin Phe Val Asp Glu Thr Asn Asp Lys He Arg Glu Val 
2435 2440 2445 

act cag aga etc aat ggt gaa att cag get ctg gaa eta cca caa aaa 

Thr Gin Arg Leu Asn Gly Glu He Gin Ala Leu Glu Leu Pro Gin Lys 
2450 2455 2460 

get gaa gca tta aaa ctg ttt tta gag gaa acc aag gec aca gtt gca 

Ala Glu Ala Leu Lys Leu Phe Leu Glu Glu Thr Lys Ala Thr Val Ala 
2465 2470 2475 

gtg tat ctg gaa age eta cag gac acc aaa ata acc tta ate ate aat 

Val Tyr Leu Glu Ser Leu Gin Asp Thr Lys lie Thr Leu He lie Asn 

24S0 2485 2490 

tgg tta cag gag get tta agt tea gca tet ttg get cac atg aag gec 

Trp Leu Gin Glu Ala Leu Ser Ser Ala Ser Leu Ala His Met Lys Ala 

2495 2500 2505 2510 

aaa ttc cga gag act eta gaa gat aca cga gac cga atg tat caa atg 

Lys Phe Arg Glu Thr Leu Glu Asp Thr Arg Asp Arg Met Tyr Gin Met 
2515 2520 2525 

gac att cag cag gaa ctt caa cga tac ctg tct ctg gta ggc cag gtt 
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Asp lie Gin Gin Glu Leu Gin Arg Tyr Leu Ser Leu Val Gly Gin Val 

2530 2535 2540 

tat age aca ctt gtc ace tac att tct gat tgg tgg act ctt get get 

Tyr Ser Thr Leu Val Thr Tyr lie Ser Asp Trp Trp Thr Leu Ala Ala 

2545 2550 2555 

aag aac ctt act gac ttt gca gag caa tat tct ate caa gat tgg get 

Lys Asn Leu Thr Asp Phe Ala Glu Gin Tyr Ser lie Gin Asp Trp Ala 

2560 2565 2570 

aaa cgt atg aaa gca ttg gta gag caa ggg ttc act gtt cct gaa ate 

Lys Arg Met Lys Ala Leu Val Glu Gin Gly Phe Thr Val Pro Glu lie 

2575 2580 2585 2590 

aag ace ate ctt ggg ace atg cct gec ttt gaa gtc agt ctt cag get 

Lys Thr lie Leu Gly Thr Met Pro Ala Phe Glu Val Ser Leu Gin Ala 
2595 2600 2605 

ctt cag aaa get acc ttc cag aea cct gat ttt ata gtc ccc eta aca 

Leu Gin Lys Ala Thr Phe Gin Thr Pro Asp Phe lie Val Pro Leu Thr 

2610 2615 2620 

gat ttg agg att cca tea gtt cag ata aac ttc aaa gac tta aaa aat 

Asp Leu Arg lie Pro Ser Val Gin lie Asn Phe Lys Asp Leu Lys Asn 

2625 2630 2635 

ata aaa ate cca tec agg ttt tec aca cca gaa ttt acc ate ctt aac 

Tie Lys Tie Pro Ser Arg Phe Ser Thr Pro Glu Phe Thr Tie Leu Asn 

2640 2645 2650 

acc ttc cac att act tec ttt aca att gac ttt gtc gaa atg aaa gta 

Thr Phe His lie Pro Ser Phe Thr lie Asp Phe Val Glu Met Lys Val 

2655 2660 2665 2670 

aag ate ate aga acc att gac cag atg cag aac agt gag ctg cag tgg 

Lys He lie Arg Thr He Asp Gin Met Gin Asn Ser Glu Leu Gin Trp 
2675 2680 2685 

ccc gtt cca gat ata tat etc agg gat ctg aag gtg gag gac att cct 

Pro Val Pro Asp He Tyr Leu Arg Asp Leu Lys Val Glu Asp He Pro 

2690 2695 2700 

eta gcg aga ate acc ctg cca gac ttc cgt tta cca gaa ate gca att 

Leu Ala Arg lie Thr Leu Pro Asp Phe Arg Leu Pro Glu lie Ala lie 

2705 2710 2715 

cea gaa ttc ata ate cca act etc aac ctt aat gat ttt caa gtt cct 

Pro Glu Phe He lie Pro Thr Leu Asn Leu Asn Asp Phe Gin Val Pro 

2720 2725 2730 

gac ctt cac ata cca gaa ttc cag ctt ccc cac ate tea cac aca att 

Asp Leu His He Pro Glu Phe Gin Leu Pro His He Ser His Thr He 

2735 2740 2745 2750 

gaa gta cct act ttt ggc aag eta tac agt att ctg aaa ate caa tct 

Glu Val Pro Thr Phe Gly Lys Leu Tyr Ser He Leu Lys He Gin Ser 
2755 2760 2765 

cct ctt ttc aca tta gat gca aat get gac ata ggg aat gga acc acc 

Pro Leu Phe Thr Leu Asp Ala Asn Ala Asp He Gly Asn Gly Thr Thr 

2770 2775 2780 

tea gca aac gaa gca ggt ate gca get tec ate act gec aaa gga gag 

Ser Ala Asn Glu Ala Gly lie Ala Ala Ser He Thr Ala Lys Gly Glu 

2785 2790 2795 

tec aaa tta gaa gtt etc aat ttt gat ttt caa gca aat gca caa etc 

Ser Lys Leu Glu Val Leu Asn Phe Asp Phe Gin Ala Asn Ala Gin Leu 

2800 2805 2810 

tea aac cct aag att aat ccg ctg get ctg aag gag tea gtg aag ttc 

Ser Asn Pro Lys He Asn Pro Leu Ala Leu Lys Glu Ser Val Lys Phe 

2815 2820 2825 2830 

tec age aag tac ctg aga acg gag cat ggg agt gaa atg ctg ttt ttt 
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Ser Ser Lys Tyr Leu Arg Thr Glu His Gly Ser Glu Met Leu Phe Phe 
2835 2840 2S45 

gga aat get att gag gga aaa tea aac aca gtg gca agt tta cac aca 

Gly Asn Ala lie Glu Gly Lys Ser Asn Thr Val Ala Ser Leu His Thr 

2850 2855 2860 

gaa aaa aat aca ctg gag ctt agt aat gga gtg att gtc aag ata aac 

Glu Lys Asn Thr Leu Glu Leu Ser Asn Gly Val lie Val Lys lie Asn 
2865 2870 2875 

aat cag ctt acc ctg gat age aac act aaa tac ttc cac aaa ttg aac 

Asn Gin Leu Thr Leu Asp Ser Asn Thr Lys Tyr Phe His Lys Leu Asn 

2880 2885 2890 

ate ccc aaa ctg gac ttc tct agt cag get gac ctg cgc aac gag ate 

lie Pro Lys Leu Asp Phe Ser Ser Gin Ala Asp Leu Arg Asn Glu lie 

2895 2900 2905 2910 

aag aca ctg ttg aaa get ggc cac ata gca tgg act tct tct gga aaa 

Lys Thr Leu Leu Lys Ala Gly His lie Ala Trp Thr Ser Ser Gly Lys 
2915 2920 2925 

ggg tea tgg aaa tgg gec tgc ccc aga ttc tea gat gag gga aca cat 

Gly Ser Trp Lys Trp Ala Cys Pro Arg Phe Ser Asp Glu Gly Thr His 

2930 2935 2940 

gaa tea caa att agt ttc acc ata gaa gga ccc etc act tec ttt gga 

Glu Ser Gin lie Ser Phe Thr Tie Glu Gly Pro Leu Thr Ser Phe Gly 
2945 2950 2955 

ctg tac aat aag ate aat age aaa aac eta aga gta aac aaa aac ttg 

Leu Ser Asn Lys lie Asn Ser Lys His Leu Arg Val Asn Gin Asn Leu 

2960 2965 2970 

gtt tat gaa tct ggc tec etc aac ttt tct aaa ctt gaa att caa tea 

Val Tyr Glu Ser Gly Ser Leu Asn Phe Ser Lys Leu Glu lie Gin Ser 

2975 2980 2985 2990 

caa gtc gat tec cag cat gtg ggc cac agt gtt eta act get aaa ggc 

Gin Val Asp Ser Gin His Val Gly His Ser Val Leu Thr Ala Lys Gly 
2995 3000 3005 

atg gca ctg ttt gga gaa ggg aag gca gag ttt act ggg agg cat gat 

Met Ala Leu Phe Gly Glu Gly Lys Ala Glu Phe Thr Gly Arg His Asp 

3010 3015 3020 

get cat tta aat gga aag gtt att gga act ttg aaa aat tct ctt ttc 

Ala His Leu Asn Gly Lye Val lie Gly Thr Leu Lys Asn Ser Leu Phe 
3G25 3030 3035 

ttt tea gec cag cca ttt gag ate aeg gca tee aca aac aat gaa ggg 

Phe Ser Ala Gin Pro Phe Glu lie Thr Ala Ser Thr Aen Asn Glu Gly 

3040 3045 3050 

aat ttg aaa gtt cgt ttt cca tta agg tta aca ggg aag ata gac ttc 

Asn Leu Lys Val Arg Phe Pro Leu Arg Leu Thr Gly Lys lie Asp Phe 

3055 3060 3065 3070 

ctg aat aac tat gca ctg ttt ctg agt ccc agt gec cag caa gca agt 

Leu Asn Asn Tyr Ala Leu Phe Leu Ser Pro Ser Ala Gin Gin Ala Ser 
3075 3080 3085 

tgg caa gta agt get agg ttc aat cag tat aag tac aac caa aat ttc 

Trp Gin Val Ser Ala Arg Phe Asn Gin Tyr Lys Tyr Asn Gin Asn Phe 

3090 3095 3100 

tct get gga aac aac gag aac att atg gag gec cat gta gga ata aat 

Ser Ala Gly Asn Asn Glu Asn lie Met Glu Ala His Val Gly lie Asn 
3105 3110 3115 

gga gaa gca aat ctg gat ttc tta aac att cct tta aca att cct gaa 

Gly Glu Ala Asn Leu Asp Phe Leu Asn lie Pro Leu Thr lie Pro Glu 

3120 3125 3130 

atg cgt eta cct tac aca ata ate aca act cct cca ctg aaa gat ttc 
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Met Arg Leu Pro Tyr Thr lie lie Thr Thr Pro Pro Leu Lys Asp Phe 

3135 3140 3145 3150 

tct eta tgg gaa aaa aca ggc ttg aag gaa ttc ttg aaa acg aca aag 

Ser Leu Trp Glu Lys Thr Gly Leu Lys Glu Phe Leu Lys Thr Thr Lys 

3155 3160 3165 

caa tea ttt gat tta agt gta aaa get cag tat aag aaa aac aaa cac 

Gin Ser Phe Asp Leu Ser Val Lys Ala Gin Tyr Lys Lys Asn Lys His 

3170 3175 3180 

agg cat tec ate aca aat cct ttg get gtg ctt tgt gag ttt ate agt 

Arg His Ser lie Thr Asn Pro Leu Ala Val Leu Cys Glu Phe lie Ser 

3185 3190 3195 

cag age ate aaa tee ttt gac agg cat ttt gaa aaa aac aga aac aat 

Gin Ser lie Lys Ser Phe Asp Arg His Phe Glu Lys Asn Arg Asn Asn 

3200 3205 3210 

gca tta gat ttt gtc acc aaa tec tat aat gaa aca aaa att aag ttt 

Ala Leu Asp Phe Val Thr Lys Ser Tyr Asn Glu Thr Lys lie Lys Phe 

3215 3220 3225 3230 

gat aag tac aaa get gaa aaa tct cac gac gag etc ccc agg acc ttt 

Asp Lys Tyr Lys Ala Glu Lys Ser His Asp Glu Leu Pro Arg Thr Phe 

3235 3240 3245 

caa att cct gga tac act gtt cca gtt gtc aat gtt gaa gtg tct cca 

Gin He Pro Gly Tyr Thr Val Pro Val Val Asn Val Glu Val Ser Pro 

3250 3255 3260 

ttc acc ata gag atg teg gca ttc ggc tat gtg ttc cca aaa gca gtc 

Phe Thr lie Glu Met Ser Ala Phe Gly Tyr Val Phe Pro Lys Ala Val 

3265 3270 3275 

age atg cct agt ttc tec ate eta ggt tct gac gtc cgt gtg cct tea 

Ser Met Pro Ser Phe Ser lie Leu Gly Ser Asp Val Arg Val Pro Ser 

3230 3285 3290 

tac aca tta ate ctg cca tea tta gag ctg cca gtc ctt cat gtc cct 

Tyr Thr Leu He Leu Pro Ser Leu Glu Leu Pro Val Leu His Val Pro 

3295 3300 3305 3310 

aga aat etc aag ctt tct ctt cca cat ttc aag gaa ttg tgt acc ata 

Arg Asn Leu Lys Leu Ser Leu Pro His Phe Lys Glu Leu Cys Thr lie 

3315 3320 3325 

age cat att ttt att cct gee atg ggc aat att acc tat gat ttc tec 

Ser His lie Phe lie Pro Ala Met Gly Asn He Thr Tyr Asp Phe Ser 

3330 3335 3340 

ttt aaa tea agt gtc ate aca ctg aat acc aat get gaa ctt ttt aac 

Phe Lys Ser Ser Val lie Thr Leu Asn Thr Asn Ala Glu Leu Phe Asn 

3345 3350 3355 

cag tea gat att gtt get cat etc ctt tct tea tct tea tct gtc att 

Gin Ser Asp He Val Ala His Leu Leu Ser Ser Ser Ser Ser Val He 

3360 3365 3370 

gat gca ctg cag tac aaa tta gag ggc acc aca aga ttg aca aga aaa 

Asp Ala Leu Gin Tyr Lys Leu Glu Gly Thr Thr Arg Leu Thr Arg Lys 

3375 3380 3385 3390 

a Sra 99 a ttg aag tta gec aca get ctg tct ctg age aac aaa ttt gtg 

Arg Gly Leu Lys Leu Ala Thr Ala Leu Ser Leu Ser Asn Lys Phe Val 

3395 3400 3405 

gag ggt agt cat aac agt act gtg age tta acc acg aaa aat atg gaa 

Glu Gly Ser His Asn Ser Thr Val Ser Leu Thr Thr Lys Asn Met Glu 

3410 3415 3420 

gtg tea gtg gca aaa acc aca aaa gee gaa att cca att ttg aga atg 

Val Ser Val Ala Lys Thr Thr Lys Ala Glu He Pro He Leu Arg Met 

3425 3430 3435 

aat ttc aag caa gaa ctt aat gga aat acc aag tea aaa cct act gtc 
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Asn Phe Lys Gin Glu Leu Abu Gly Asn Thr Lys Ser Lys Pro Thr Val 

3440 3445 3450 

tct tec tec atg gaa ttt aag tat gat ttc aat tct tea atg ctg tac 

Ser Ser Ser Met Glu Phe Lys Tyr Asp Phe Asn Ser Ser Met Leu Tyr 

3455 3460 3465 3470 

tct acc get aaa gga gca gtt gac cac aag ctt age ttg gaa age etc 

Ser Thr Ala Lys Gly Ala Val Asp His Lys Leu Ser Leu Glu Ser Leu 

3475 3480 34S5 

acc tct tac ttt tec att gag tea tct acc aaa gga gat gtc aag ggt 

Thr Ser Tyr Phe Ser lie Glu Ser Ser Thr Lys Gly Asp Val Lys Gly 

3490 3495 3500 

teg gtt ctt tct egg gaa tat tea gga act att get agt gag gec aac 

Ser Val Leu Ser Arg Glu Tyr Ser Gly Thr He Ala Ser Glu Ala Asn 

3505 3510 3515 

act tac ttg aat tec aag age aca egg tct tea gtg aag ctg cag ggc 

Thr Tyr Leu Asn Ser Lys Ser Thr Arg Ser Ser Val Lys Leu Gin Gly 

3520 3525 3530 

act tec aaa att gat gat ate tgg aac ctt gaa gta aaa gaa aat ttt 

Thr Ser Lys He Asp Asp lie Trp Asn Leu Glu Val Lys Glu Asn Phe 

3535 3540 3545 3550 

get gga gaa gee aca etc caa cgc ata tat tec etc tgg gag cac agt 

Ala Gly Glu Ala Thr Leu Gin Arg Tie Tyr Ser Leu Trp Glu His Ser 

3555 3560 3565 

acg aaa aac cac tta cag eta gag gga etc ttt ttc ace aac gga gaa 

Thr Lys Asn His Leu Gin Leu Glu Gly Leu Phe Phe Thr Asn Gly Glu 

3570 3575 3580 

cat aca age aaa gee acc ctg gaa etc tct cca tgg caa atg tea get 

His Thr Ser Lys Ala Thr Leu Glu Leu Ser Pro Trp Gin Met Ser Ala 

3585 3590 3595 

ctt gtt cag gtc cat gca agt cag ccc agt tec ttc cat gat ttc cct 

Leu Val Gin Val His Ala Ser Gin Pro Ser Ser Phe His Asp Phe Pro 

3600 3605 3610 

gac ctt ggc cag gaa gtg gec ctg aat get aac act aag aac cag aag 

Asp Leu Gly Gin Glu Val Ala Leu Asn Ala Asn Thr Lys Asn Gin Lys 

3615 3620 3S25 3630 

ate aga tgg aaa aat gaa gtc egg att cat tct ggg tct ttc cag age 

lie Arg Trp Lys Asn Glu Val Arg He His Ser Gly Ser Phe Gin Ser 

3635 3640 3645 

cag gtc gag ctt tec aat gac caa gaa aag gca cac ctt gac att gca 

Gin Val Glu Leu Ser Asn Asp Gin Glu Lys Ala His Leu Asp lie Ala 

3650 3655 3660 

gga tec tta gaa gga cac eta agg ttc etc aaa aat ate ate eta cca 

Gly Ser Leu Glu Gly His Leu Arg Phe Leu Lys Asn He He Leu Pro 

3665 3670 3675 

gtc tat gac aag age tta tgg gat ttc eta aag ctg gat gta acc acc 

Val Tyr Asp Lys Ser Leu Trp Asp Phe Leu Lys Leu Asp Val Thr Thr 

3680 3685 3690 

age att ggt agg aga cag cat ctt cgt gtt tea act gec ttt gtg tac 

Ser He Gly Arg Arg Gin His Leu Arg Val Ser Thr Ala Phe Val Tyr 

3695 3700 3705 3710 

acc aaa aac ccc aat ggc tat tea ttc tec ate cct gta aaa gtt ttg 

Thr Lys Asn Pro Asn Gly Tyr Ser Phe Ser He Pro Val Lys Val Leu 

3715 3720 3725 

get gat aaa ttc att act cct ggg ctg aaa eta aat gat eta aat tea 

Ala Asp Lys Phe He Thr Pro Gly Leu Lys Leu Asn Asp Leu Asn Ser 

3730 3735 3740 

gtt ctt gtc atg cct acg ttc cat gtc cca ttt aca gat ctt cag gtt 
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Val Leu Val Met Pro Thr Phe His Val Pro Phe Thr Asp Leu Gin Val 

3745 3750 3755 

cca teg tgc aaa ctt gac ttc aga gaa ata caa ate tat aag aag ctg 

Pro Ser Cys Lys Leu Asp Phe Arg Glu lie Gin lie Tyr Lys Lys Leu 

3760 3765 3770 

aga act tea tea ttt gec etc aac eta cca aca etc ccc gag gta aaa 

Arg Thr Ser Ser Phe Ala Leu Asn Leu Pro Thr Leu Pro Glu Val Lys 

3775 3780 3785 3790 

ttc cct gaa gtt gat gtg tta aca aaa tat tct caa cca gaa gac tec 

Phe Pro Glu Val Asp Val Leu Thr Lys Tyr Ser Gin Pro Glu Asp Ser 
3795 3800 3805 

ttg att ccc ttt ttt gag ata ace gtg cct gaa tct cag tta act gtg 

Leu lie Pro Phe Phe Glu He Thr Val Pro Glu Ser Gin Leu Thr Val 
3810 3815 3820 

tec cag ttc acg ctt cca aaa agt gtt tea gat ggc att get get ttg 

Ser Gin Phe Thr Leu Pro Lys Ser Val Ser Asp Gly He Ala Ala Leu 

3825 3830 3835 

gat eta aat gca gta gec aac aag ate gca gac ttt gag ttg ccc acc 

Asp Leu Asn Ala Val Ala Asn Lys He Ala Asp Phe Glu Leu Pro Thr 

3840 3845 3850 

ate ate gtg cct gag cag acc att gag att ccc tec att aag ttc tct 

Tie Tie Val Pro Glu Gin Thr Tie Glu Tie Pro Ser He Lys Phe Ser 

3855 3860 3865 3870 

gta cot get gga att gtc att cct tec ttt caa gca ctg act gca cgc 

Val Pro Ala Gly He Val He Pro Ser Phe Gin Ala Leu Thr Ala Arg 
3875 3880 3885 

ttt gag gta gac tct ccc gtg tat aat gec act tgg agt gee agt ttg 

Phe Glu Val Asp Ser Pro Val Tyr Aen Ala Thr Trp Ser Ala Ser Leu 
3890 3895 3900 

aaa aac aaa gca gat tat gtt gaa aca gtc ctg gat tec aca tgc age 

Lys Asn Lys Ala Asp Tyr Val Glu Thr Val Leu Asp Ser Thr Cys Ser 

3905 3910 3915 

tea acc gta cag ttc eta gaa tat gaa eta aat gtt ttg gga aca cac 

Ser Thr Val Gin Phe Leu Glu Tyr Glu Leu Asn Val Leu Gly Thr His 

3920 3925 3930 

aaa ate gaa gat ggt acg tta gec tct aag act aaa gga aca ctt gca 

Lys lie Glu Asp Gly Thr Leu Ala Ser Lys Thr Lys Gly Thr Leu Ala 

3935 3940 3945 3950 

cac cgt gac ttc agt gca gaa tat gaa gaa gat ggc aaa ttt gaa gga 

His Arg Asp Phe Ser Ala Glu Tyr Glu Glu Asp Gly Lys Phe Glu Gly 
3955 3960 3965 

ctt cag gaa tgg gaa gga aaa gcg cac etc aat ate aaa age cca gcg 

Leu Gin Glu Trp Glu Gly Lys Ala His Leu Asn lie Lys Ser Pro Ala 
3970 3975 3980 

ttc acc gat etc cat ctg cgc tac cag aaa gac aag aaa ggc ate tec 

Phe Thr Asp Leu His Leu Arg Tyr Gin Lys Asp Lys Lys Gly lie Ser 

39S5 3990 3995 

acc tea gca gec tec cca gee gta ggc acc gtg ggc atg gat atg gat 

Thr Ser Ala Ala Ser Pro Ala Val Gly Thr Val Gly Met Asp Met Asp 

4000 4005 4010 

gaa gat gac gac ttt tct aaa tgg aac ttc tac tac age cct cag tec 

Glu Asp Asp Asp Phe Ser Lys Trp Asn Phe Tyr Tyr Ser Pro Gin Ser 

4015 4020 4025 4030 

tct cca gat aaa aaa etc acc ata ttc aaa act gag ttg agg gtc egg 

Ser Pro Asp Lys Lys Leu Thr He Phe Lys Thr Glu Leu Arg Val Arg 
4035 4040 4045 

gaa tct gat gag gaa act cag ate aaa gtt aat tgg gaa gaa gag gca 
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Glu Ser Asp Glu Glu Thr Gin lie Lys Val Asn Trp Glu Glu Glu Ala 

4050 4055 4060 

get tct ggc ttg eta acc tct ctg aaa gac aac gtg ccc aag gec aca 

Ala Ser Gly Leu Leu Thr Eer Leu Lys Asp Asn Val Pro Lys Ala Thr 
4065 4070 4075 

ggg gtc ctt tat gat tat gtc aac aag tac cac tgg gaa cac aca ggg 

Gly Val Leu Tyr Asp Tyr Val Asn Lys Tyr His Trp Glu His Thr Gly 
4080 4085 4090 

etc acc ctg aga gaa gtg tct tea aag ctg aga aga aat ctg cag aac 

Leu Thr Leu Arg Glu Val Ser Ser Lys Leu Arg Arg Asn Leu Gin Asn 

4095 4100 4105 4110 

aat get gag tgg gtt tat caa ggg gec att agg caa att gat gat ate 

Asn Ala Glu Trp Val Tyr Gin Gly Ala lie Arg Gin lie Asp Asp lie 
4115 4120 4125 

gac gtg agg ttc cag aaa gca gec agt ggc acc act ggg acc tac caa 

Asp Val Arg Phe Gin Lys Ala Ala Ser Gly Thr Thr Gly Thr Tyr Gin 

4130 4135 4140 

gag tgg aag gac aag gec cag aat ctg tac cag gaa ctg ttg act cag 

Glu Trp Lys Asp Lys Ala Gin Asn Leu Tyr Gin Glu Leu Leu Thr Gin 
4145 4150 4155 

gaa ggc caa gec agt ttc cag gga etc aag gat aac gtg ttt gat ggc 

Glu Gly Gin Ala Ser Phe Gin Gly Leu Lys Asp Asn Val Phe Asp Gly 
4160 4165 4170 

ttg gta aga gtt act caa aaa ttc cat atg aaa gtc aag cat ctg att 

Leu Val Arg Val Thr Gin Lys Phe His Met Lys Val Lys His Leu He 

4175 4180 4185 4190 

gac tea etc att gat ttt ctg aac ttc ccc aga ttc cag ttt ccg ggg 

Asp Ser Leu lie Asp Phe Leu Asn Phe Pro Arg Phe Gin Phe Pro Gly 
4195 4200 4205 

aaa cct ggg ata tac act agg gag gaa ctt tgc act atg ttc ata agg 

Lys Pro Gly He Tyr Thr Arg Glu Glu Leu Cys Thr Met Phe He Arg 

4210 4215 4220 

gag gta ggg acg gta ctg tec cag gta tat teg aaa gtc cat aat ggt 

Glu Val Gly Thr Val Leu Ser Gin Val Tyr Ser Lys Val His Asn Gly 
4225 4230 4235 

tea gaa ata ctg ttt tec tat ttc caa gac eta gtg att aca ctt cct 

Ser Glu lie Leu Phe Ser Tyr Phe Gin Asp Leu Val He Thr Leu Pro 
4240 4245 4250 

ttc gag tta agg aaa cat aaa eta ata gat gta ate teg atg tat agg 

Phe Glu Leu Arg Lys Hie Lys Leu He Asp Val lie Ser Met Tyr Arg 

4255 4260 4265 4270 

gaa ctg ttg aaa gat tta tea aaa gaa gec caa gag gta ttt aaa gec 

Glu Leu Leu Lys Asp Leu Ser Lys Glu Ala Gin Glu Val Phe Lys Ala 
4275 4280 4285 

att cag tct etc aag acc aca gag gtg eta cgt aat ctt cag gac ctt 

lie Gin Ser Leu Lys Thr Thr Glu Val Leu Arg Asn Leu Gin Asp Leu 

4290 4295 4300 

tta caa ttc att ttc oaa eta ata gaa gat aac att aaa cag otg aaa 

Leu Gin Phe He Phe Gin Leu He Glu Asp Asn He Lys Gin Leu Lys 
4305 4310 4315 

gag atg aaa ttt act tat ctt att aat tat ate caa gat gag ate aac 

Glu Met Lys Phe Thr Tyr Leu He Asn Tyr He Gin Asp Glu He Asn 
4320 4325 4330 

aca ate ttc aat gat tat ate cca tat gtt ttt aaa ttg ttg aaa gaa 

Thr He Phe Asn Asp Tyr He Pro Tyr Val Phe Lys Leu Leu Lys Glu 

4335 4340 4345 4350 

aac eta tgc ctt aat ctt cat aag ttc aat gaa ttt att caa aac gag 
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Asn Leu Cys Leu Asn Leu His Lys Phe Asn Glu Phe lie Gin Asn Glu 
4355 4360 4365 

ctt Gag gaa get tct naa gag tta cag cag ate cat caa tac att atg 1327 4 

Leu Gin Glu Ala Ser Gin Glu Leu Gin Gin lie His Gin Tyr lie Met 
4370 4375 4380 

gec ctt cgt gaa gaa tat ttt gat cca agt ata gtt ggc tgg aca gtg 13 322 

Ala Leu Arg Glu Glu Tyr Phe Asp Pro Ser lie Val Gly Trp Thr Val 

4385 4390 4395 

aaa tat tat gaa ctt gaa gaa aag ata gtc agt ctg ate aag aac ctg 13 37 0 

Lys Tyr Tyr Glu Leu Glu Glu Lys lie Val Ser Leu lie Lys Asn Leu 

4400 4405 4410 

tta gtt get ctt aag gac ttc cat tct gaa tat att gtc agt gec tct 13418 

Leu Val Ala Leu Lys Asp Phe His Ser Glu Tyr lie Val Ser Ala Ser 
4415 4420 4425 4430 

aac ttt act tec caa etc tea agt caa gtt gag caa ttt ctg cac aga 13 46 6 

Asn Phe Thr Ser Gin Leu Ser Ser Gin Val Glu Gin Phe Leu His Arg 
4435 4440 4445 

aat att cag gaa tat ctt age ate ctt ace gat cca gat gga aaa ggg 13514 

Asn lie Gin Glu Tyr Leu Ser lie Leu Thr Asp Pro Asp Gly Lys Gly 
4450 4455 4460 

aaa gag aag att gca gag ctt tct gec act get cag gaa ata att aaa 13562 

Lys Glu Lys lie Ala Glu Leu Ser Ala Thr Ala Gin Glu Tie lie Lys 

4465 4470 4475 

age cag gec att gag aag aag aaa ata att tat gat tac cac cag cag 13610 

Ser Gin Ala lie Ala Thr Lys Lys He He Ser Asp Tyr His Gin Gin 

4480 4485 4490 

ttt aga tat aaa ctg caa gat ttt tea gac caa etc tct gat tac tat 13 65 8 

Phe Arg Tyr Lys Leu Gin Asp Phe Ser Asp Gin Leu Ser Asp Tyr Tyr 
4495 4500 4505 4510 

gaa aaa ttt att get gaa tec aaa aga ttg att gac ctg tec att caa 13 70 6 

Glu Lys Phe He Ala Glu Ser Lys Arg Leu He Asp Leu Ser He Gin 
4515 4520 4525 

aac tac cac aca ttt ctg ata tac ate acg gag tta ctg aaa aag ctg 13 75 4 

Asn Tyr His Thr Phe Leu lie Tyr lie Thr Glu Leu Leu Lys Lys Leu 
4530 4535 4540 

caa tea acc aca gtc atg aac ccc tac atg aag ctt get cca gga gaa 13 802 

Gin Ser Thr Thr Val Met Asn Pro Tyr Met Lys Leu Ala Pro Gly Glu 

4545 4550 4555 

ctt act ate ate etc taa ttttttaaaa gaaatcttca tttattcttc 13 850 

Leu Thr lie He Leu * 
4560 

ttttccaatt gaactttcac atagcacaga aaaaattcaa actgectata ttgataaaac 13910 

catacagtga gccagccttg cagtaggcag tagactataa gcagaagcac atatgaactg 13 97 0 

gacctgcacc aaagctggca ccagggctcg gaaggtctct gaactcagaa ggatggcatt 14 03 0 

ttttgeaagt taaagaaaat caggatctga gttattttgc taaacttggg ggaggaggaa 14 09 0 
caaataaatg gagtctttat tgtgtatcat a 14 12 1 


<210> SEQ ID HO 32 

<211> LENGTH: 45 6 3 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapien 

<400> SEQUENCE: 32 


Met Asp Pro Pro Arg Pro Ala Leu Leu Ala Leu Leu Ala Leu Pro Ala 
15 10 15 
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Leu Leu Leu Leu Leu Leu Ala Gly Ala Arg Ala Glu Glu Glu Met Leu 

20 25 30 

Glu Abu Val Ser Leu Val Cys Pro Lys Asp Ala Thr Arg Phe Lye His 

35 40 45 

Leu Arg Lys Tyr Thr Tyr Asn Tyr Glu Ala Glu Ser Ser Ser Gly Val 

50 55 60 

Pro Gly Thr Ala Asp Ser Arg Ser Ala Thr Arg lie Aan Cys Lys Val 


Glu Leu Glu Val Pro Gin Leu Cys Ser Phe lie Leu Lys Thr Ser Gin 
85 90 95 

Cys Thr Leu Lys Glu Val Tyr Gly Phe Asn Pro Glu Gly Lys Ala Leu 
100 105 110 

Leu Lys Lys Thr Lys Asn Ser Glu Glu Phe Ala Ala Ala Met Ser Arg 
115 120 125 

Tyr Glu Leu Lys Leu Ala lie Pro Glu Gly Lys Gin Val Phe Leu Tyr 
130 135 140 

Pro Glu Lys Asp Glu Pro Thr Tyr lie Leu Asn lie Lys Arg Gly lie 
145 150 155 160 

lie Ser Ala Leu Leu Val Pro Pro Glu Thr Glu Glu Ala Lys Gin Val 

165 170 175 

Leu Phe Leu Asp Thr Val Tyr Gly Asn Cys Ser Thr His Phe Thr Val 

180 185 190 

Lys Thr Arg Lys Gly Asn Val Ala Thr Glu lie Ser Thr Glu Arg Asp 
195 200 205 

Leu Gly Gin Cys Asp Arg Phe Lys Pro lie Arg Thr Gly lie Ser Pro 
210 215 220 

Leu Ala Leu lie Lys Gly Met Thr Arg Pro Leu Ser Thr Leu He Ser 
225 230 235 240 

Ser Ser Gin Ser Cys Gin Tyr Thr Leu Asp Ala Lys Arg Lys His Val 
245 250 255 

Ala Glu Ala He Cys Lys Glu Gin His Leu Phe Leu Pro Phe Ser Tyr 
260 265 270 

Asn Asn Lys Tyr Gly Met Val Ala Gin Val Thr Gin Thr Leu Lye Leu 
275 280 285 

Glu Asp Thr Pro Lys He Asn Ser Arg Phe Phe Gly Glu Gly Thr Lys 
290 295 300 

Lys Met Gly Leu Ala Phe Glu Ser Thr Lys Ser Thr Ser Pro Pro Lys 
305 310 315 320 

Gin Ala Glu Ala Val Leu Lys Thr Leu Gin Glu Leu Lys Lys Leu Thr 

325 330 335 

lie Ser Glu Gin Asn lie Gin Arg Ala Asn Leu Phe Asn Lys Leu Val 
340 345 350 

Thr Glu Leu Arg Gly Leu Ser Asp Glu Ala Val Thr Ser Leu Leu Pro 
355 360 365 

Gin Leu lie Glu Val Ser Ser Pro He Thr Leu Gin Ala Leu Val Gin 
370 375 380 

Cys Gly Gin Pro Gin Cys Ser Thr His He Leu Gin Trp Leu Lys Arg 
385 390 395 400 

Val His Ala Asn Pro Leu Leu He Asp Val Val Thr Tyr Leu Val Ala 
405 410 415 

Leu He Pro Glu Pro Ser Ala Gin Gin Leu Arg Glu He Phe Asn Met 
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Ala Arg Asp Gin Arg Ser Arg Ala Thr Leu Tyr Ala Leu Ser His Ala 

435 440 445 

Val Asn Asn Tyr His Lys Thr Asn Pro Thr Gly Thr Gin Glu Leu Leu 
450 455 460 

Asp lie Ala Asn Tyr Leu Met Glu Gin lie Gin Asp Asp Cys Thr Gly 
465 470 475 480 

Asp Glu Asp Tyr Thr Tyr Leu lie Leu Arg Val lie Gly Asn Met Gly 
485 490 495 

Gin Thr Met Glu Gin Leu Thr Pro Glu Leu Lys Ser Ser lie Leu Lys 
500 505 510 

Cys Val Gin Ser Thr Lys Pro Ser Leu Met lie Gin Lys Ala Ala lie 
515 520 525 

Gin Ala Leu Arg Lys Met Glu Pro Lys Asp Lys Asp Gin Glu Val Leu 
530 535 540 

Leu Gin Thr Phe Leu Asp Asp Ala Ser Pro Gly Asp Lys Arg Leu Ala 
545 550 555 560 

Ala Tyr Leu Met Leu Met Arg Ser Pro Ser Gin Ala Asp lie Asn Lys 
565 570 575 

He Val Gin He Leu Pro Trp Glu Gin Asn Glu Gin Val Lys Asn Phe 
580 585 590 

Val Ala Ser His He Ala Asn He Leu Asn Ser Glu Glu Leu Asp He 
595 600 605 

Gin Asp Leu Lys Lys Leu Val Lys Glu Ala Leu Lys Glu Ser Gin Leu 
610 615 620 

Pro Thr Val Met Asp Phe Arg Lys Phe Ser Arg Asn Tyr Gin Leu Tyr 
625 630 635 640 

Lys Ser Val Ser Leu Pro Ser Leu Asp Pro Ala Ser Ala Lys He Glu 
645 650 655 

Gly Asn Leu He Phe Asp Pro Asn Asn Tyr Leu Pro Lys Glu Ser Met 
660 S65 670 

Leu Lys Thr Thr Leu Thr Ala Phe Gly Phe Ala Ser Ala Asp Leu lie 
675 680 685 

Glu He Gly Leu Glu Gly Lys Gly Phe Glu Pro Thr Leu Glu Ala Leu 
690 695 700 

Phe Gly Lys Gin Gly Phe Phe Pro Asp Ser Val Asn Lys Ala Leu Tyr 
705 710 715 720 

Trp Val Asn Gly Gin Val Pro Asp Gly Val Ser Lys Val Leu Val Asp 
725 730 735 

His Phe Gly Tyr Thr Lys Asp Asp Lys His Glu Gin Asp Met Val Asn 
740 745 750 

Gly He Met Leu Ser Val Glu Lys Leu He Lys Asp Leu Lys Ser Lys 
755 760 765 

Glu Val Pro Glu Ala Arg Ala Tyr Leu Arg He Leu Gly Glu Glu Leu 
770 775 780 

Gly Phe Ala Ser Leu His Asp Leu Gin Leu Leu Gly Lys Leu Leu Leu 
785 790 795 800 

Met Gly Ala Arg Thr Leu Gin Gly He Pro Gin Met He Gly Glu Val 
805 810 815 

He Arg Lys Gly ser Lys Asn Asp Phe Phe Leu His Tyr He Phe Met 

820 S25 830 
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Glu Asn Ala Phe Glu Leu Pro Thr Gly Ala Gly Leu Gin Leu Gin lie 
835 840 845 

Ser Ser Eer Gly Val lie Ala Pro Gly Ala Lys Ala Gly Val Lys Leu 
850 855 860 

Glu Val Ala Asn Met Gin Ala Glu Leu Val Ala Lys Pro Ser Val Ser 
865 870 875 880 

Val Glu Phe Val Thr Asn Met Gly lie He He Pro Asp Phe Ala Arg 

885 890 895 

Ser Gly Val Gin Met Asn Thr Asn Phe Phe His Glu Ser Gly Leu Glu 
900 905 910 

Ala His Val Ala Leu Lys Ala Gly Lys Leu Lys Phe He He Pro Ser 
915 920 925 

Pro Lys Arg Pro Val Lys Leu Leu Ser Gly Gly Asn Thr Leu His Leu 
930 935 940 

Val Ser Thr Thr Lys Thr Glu Val lie Pro Pro Leu He Glu Asn Arg 
945 950 955 960 

Gin Ser Trp Ser Val Cys Lys Gin Val Phe Pro Gly Leu Asn Tyr Cys 

965 970 975 

Thr Ser Gly Ala Tyr Ser Asn Ala Ser Ser Thr Asp Ser Ala Ser Tyr 
980 985 990 

Tyr Pro Leu Thr Gly Asp Thr Arg Leu Glu Leu Glu Leu Arg Pro Thr 
995 1000 1005 

Gly Glu lie Glu Gin Tyr Ser Val Ser Ala Thr Tyr Glu Leu Gin Arg 
1010 1015 1020 

Glu Asp Arg Ala Leu Val Asp Thr Leu Lys Phe Val Thr Gin Ala Glu 
1025 1030 1035 1040 

Gly Ala Lys Gin Thr Glu Ala Thr Met Thr Phe Lys Tyr Asn Arg Gin 

1045 1050 1055 

Ser Met Thr Leu Ser Ser Glu Val Gin He Pro Asp Phe Asp Val Asp 
1060 1065 1070 

Leu Gly Thr He Leu Arg Val Asn Asp Glu Ser Thr Glu Gly Lys Thr 
1075 1080 1085 

Ser Tyr Arg Leu Thr Leu Asp He Gin Asn Lys Lys He Thr Glu Val 
1090 1095 1100 

Ala Leu Met Gly His Leu Ser Cys Asp Thr Lys Glu Glu Arg Lys He 
1105 1110 1115 1120 

Lys Gly Val He Ser He Pro Arg Leu Gin Ala Glu Ala Arg Ser Glu 

1125 1130 1135 

lie Leu Ala His Trp Ser Pro Ala Lys Leu Leu Leu Gin Met Asp Ser 
1140 1145 1150 

Ser Ala Thr Ala Tyr Gly Ser Thr Val Ser Lys Arg Val Ala Trp His 
1155 1160 1165 

Tyr Asp Glu Glu Lys lie Glu Phe Glu Trp Asn Thr Gly Thr Asn Val 
1170 1175 1180 

Asp Thr Lys Lys Met Thr Ser Asn Phe Pro Val Asp Leu Ser Asp Tyr 
1185 1190 1195 1200 

Pro Lys Ser Leu His Met Tyr Ala Asn Arg Leu Leu Asp His Arg Val 

1205 1210 1215 


Pro Glu Thr Asp Met Thr Phe Arg His Val Gly Ser Lys Leu lie Val 
1220 1225 1230 
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Ala Met Ser Ser Trp Leu Gin Lys Ala Ser Gly Ser Leu Pro Tyr Thr 
1235 1240 1245 

Gin Thr Leu Gin Asp His Leu Asn Ser Leu Lys Glu Phe Asn Leu Gin 
1250 1255 1260 

Asn Met Gly Leu Pro Asp Phe His lie Pro Glu Asn Leu Phe Leu Lys 

1265 1270 1275 1280 

Ser Asp Gly Arg Val Lye Tyr Thr Leu Asn Lys Asn Ser Leu Lys lie 
1285 1290 1295 

Glu lie Pro Leu Pro Phe Gly Gly Lys Ser Ser Arg Asp Leu Lys Met 
1300 1305 1310 

Leu Glu Thr Val Arg Thr Pro Ala Leu His Phe Lys Ser Val Gly Phe 
1315 1320 1325 

His Leu Pro Ser Arg Glu Phe Gin Val Pro Thr Phe Thr lie Pro Lys 
1330 1335 1340 

Leu Tyr Gin Leu Gin Val Pro Leu Leu Gly Val Leu Asp Leu Ser Thr 

1345 1350 1355 1360 

Asn Val Tyr Ser Asn Leu Tyr Asn Trp Ser Ala Ser Tyr Ser Gly Gly 
1365 1370 1375 

Asn Thr Ser Thr Asp His Phe Ser Leu Arg Ala Arg Tyr His Met Lys 

13S0 1385 1390 

Ala Asp Ser Val Val Asp Leu Leu Ser Tyr Asn Val Gin Gly Ser Gly 
1395 1400 1405 

Glu Thr Thr Tyr Asp His Lys Asn Thr Phe Thr Leu Ser Cys Asp Gly 
1410 1415 1420 

Ser Leu Arg His Lys Phe Leu Asp Ser Asn lie Lys Phe Ser His Val 

1425 1430 1435 1440 

Glu Lys Leu Gly Asn Asn Pro Val Ser Lys Gly Leu Leu lie Phe Asp 
1445 1450 1455 

Ala Ser Ser Ser Trp Gly Pro Gin Met Ser Ala Ser Val His Leu Asp 
1460 1465 1470 

Ser Lys Lys Lys Gin His Leu Phe Val Lys Glu Val Lys He Asp Gly 
1475 1480 1485 

Gin Phe Arg Val Ser Ser Phe Tyr Ala Lys Gly Thr Tyr Gly Leu Ser 
1490 1495 1500 

Cys Gin Arg Asp Pro Asn Thr Gly Arg Leu Asn Gly Glu Ser Asn Leu 

1505 1510 1515 1520 

Arg Phe Asn Ser Ser Tyr Leu Gin Gly Thr Asn Gin He Thr Gly Arg 
1525 1530 1535 

Tyr Glu Asp Gly Thr Leu Ser Leu Thr Ser Thr Ser Asp Leu Gin Ser 
1540 1545 1550 

Gly He lie Lys Asn Thr Ala Ser Leu Lys Tyr Glu Asn Tyr Glu Leu 
1555 1560 1565 

Thr Leu Lys Ser Asp Thr Asn Gly Lys Tyr Lys Asn Phe Ala Thr Ser 
1570 1575 1580 

Asn Lys Met Asp Met Thr Phe Ser Lys Gin Asn Ala Leu Leu Arg Ser 

1585 1590 1595 1600 

Glu Tyr Gin Ala Asp Tyr Glu Ser Leu Arg Phe Phe Ser Leu Leu Ser 

1605 1610 1615 

Gly Ser Leu Asn Ser His Gly Leu Glu Leu Asn Ala Asp He Leu Gly 
1620 1625 1630 

Thr Asp Lys He Asn Ser Gly Ala His Lys Ala Thr Leu Arg He Gly 
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Gin Asp Gly lie Ser Thr Ser Ala Thr Thr Asn Leu Lys Cys Ser Leu 
1650 1655 1660 

Leu Val Leu Glu Asn Glu Leu Asn Ala Glu Leu Gly Leu Ser Gly Ala 
1665 1670 1675 1680 

Ser Met Lys Leu Thr Thr Asn Gly Arg Phe Arg Glu His Asn Ala Lys 

1685 1690 1695 

Phe Ser Leu Asp Gly Lys Ala Ala Leu Thr Glu Leu Ser Leu Gly Ser 
1700 1705 1710 

Ala Tyr Gin Ala Met lie Leu Gly Val Asp Ser Lys Asn He Phe Asn 
1715 1720 1725 

Phe Lye Val Ser Gin Glu Gly Leu Lys Leu Ser Asn Asp Met Met Gly 
1730 1735 1740 

Ser Tyr Ala Glu Met Lys Phe Asp His Thr Asn Ser Leu Asn He Ala 
1745 1750 1755 1760 

Gly Leu Ser Leu Asp Phe Ser Ser Lys Leu Asp Asn He Tyr Ser Ser 

1765 1770 1775 

Asp Lys Phe Tyr Lys Gin Thr Val Asn Leu Gin Leu Gin Pro Tyr Ser 
1780 1785 1790 

Leu Val Thr Thr Leu Asn Ser Asp Leu Lys Tyr Asn Ala Leu Asp Leu 
1795 1800 1805 

Thr Asn Asn Gly Lys Leu Arg Leu Glu Pro Leu Lys Leu His Val Ala 
1810 1815 1820 

Gly Asn Leu Lys Gly Ala Tyr Gin Asn Asn Glu He Lys His He Tyr 
1825 1830 1835 1840 

Ala He Ser Ser Ala Ala Leu Ser Ala Ser Tyr Lys Ala Asp Thr Val 

1845 1850 1855 

Ala Lys Val Gin Gly Val Glu Phe Ser His Arg Leu Asn Thr Asp lie 
1860 1865 1870 

Ala Gly Leu Ala Ser Ala lie Asp Met Ser Thr Asn Tyr Asn Ser Asp 
1875 1880 1885 

Ser Leu His Phe Ser Asn Val Phe Arg Ser Val Met Ala Pro Phe Thr 
1890 1895 1900 

Met Thr lie Asp Ala His Thr Asn Gly Asn Gly Lys Leu Ala Leu Trp 
1905 1910 1915 1920 

Gly Glu His Thr Gly Gin Leu Tyr Ser Lys Phe Leu Leu Lys Ala Glu 

1925 1930 1935 

Pro Leu Ala Phe Thr Phe Ser His Asp Tyr Lys Gly Ser Thr Ser His 
1940 1945 1950 

His Leu Val Ser Arg Lys Ser He Ser Ala Ala Leu Glu His Lys Val 
1955 1960 1965 

Ser Ala Leu Leu Thr Pro Ala Glu Gin Thr Gly Thr Trp Lys Leu Lys 
1970 1975 1980 

Thr Gin Phe Asn Asn Asn Glu Tyr Ser Gin Asp Leu Asp Ala Tyr Asn 
1985 1990 1995 2000 

Thr Lys Asp Lys He Gly Val Glu Leu Thr Gly Arg Thr Leu Ala Asp 

2005 2010 2015 

Leu Thr Leu Leu Asp Ser Pro He Lys Val Pro Leu Leu Leu Ser Glu 
2020 2025 2030 

Pro He Asn He He Asp Ala Leu Glu Met Arg Asp Ala Val Glu Lys 
2035 2040 2045 
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Pro Gin Glu Phe Thr lie Val Ala Phe Val Lys Tyr Asp Lys Asn Gin 
2050 2055 2060 

Asp Val His Ser lie Asn Leu Pro Phe Phe Glu Thr Leu Gin Glu Tyr 

2065 2070 2075 2080 

Phe Glu Arg Asn Arg Gin Thr He He Val Val Val Glu Asn Val Gin 
2085 2090 2095 

Arg Asn Leu Lys His lie Asn He Asp Gin Phe Val Arg Lys Tyr Arg 
2100 2105 2110 

Ala Ala Leu Gly Lys Leu Pro Gin Gin Ala Asn Asp Tyr Leu Asn Ser 
2115 2120 2125 

Phe Asn Trp Glu Arg Gin Val Ser His Ala Lys Glu Lys Leu Thr Ala 
2130 2135 2140 

Leu Thr Lys Lys Tyr Arg lie Thr Glu Asn Asp lie Gin He Ala Leu 

2145 2150 2155 2160 

Asp Asp Ala Lys lie Asn Phe Asn Glu Lys Leu Ser Gin Leu Gin Thr 
2165 2170 2175 

Tyr Met lie Gin Phe Asp Gin Tyr He Lys Asp Ser Tyr Asp Leu His 
2180 2185 2190 

Asp Leu Lys He Ala lie Ala Asn He Tie Asp Glu He He Glu Lys 


Leu Lys Ser Leu Asp Glu His Tyr His He Arg Val Asn Leu Val Lys 

2210 2215 2220 

Thr He His Asp Leu His Leu Phe He Glu Asn lie Asp Phe Asn Lys 
2225 2230 2235 2240 

Ser Gly Ser Ser Thr Ala Ser Trp He Gin Asn Val Asp Thr Lys Tyr 
2245 2250 2255 

Gin He Arg He Gin He Gin Glu Lys Leu Gin Gin Leu Lys Arg His 
2260 2265 2270 

lie Gin Asn He Asp lie Gin His Leu Ala Gly Lys Leu Lys Gin His 
2275 2280 2285 

lie Glu Ala He Asp Val Arg Val Leu Leu Asp Gin Leu Gly Thr Thr 

2290 2295 2300 

lie Ser Phe Glu Arg lie Asn Asp Val Leu Glu His Val Lys His Phe 
2305 2310 2315 2320 

Val He Asn Leu lie Gly Asp Phe Glu Val Ala Glu Lys He Asn Ala 
2325 2330 2335 

Phe Arg Ala Lys Val His Glu Leu He Glu Arg Tyr Glu Val Asp Gin 
2340 2345 2350 

Gin He Gin Val Leu Met Asp Lys Leu Val Glu Leu Thr His Gin Tyr 
2355 2360 2365 

Lys Leu Lys Glu Thr lie Gin Lys Leu Ser Asn Val Leu Gin Gin Val 

2370 2375 2380 

Lys He Lys Asp Tyr Phe Glu Lys Leu Val Gly Phe lie Asp Asp Ala 
2385 2390 2395 2400 

Val Lys Lys Leu Asn Glu Leu Ser Phe Lys Thr Phe He Glu Asp Val 
2405 2410 2415 

Asn Lys Phe Leu Asp Met Leu Tie Lys Lys Leu Lys Ser Phe Asp Tyr 
2420 2425 2430 

His Gin Phe Val Asp Glu Thr Asn Asp Lys He Arg Glu Val Thr Gin 
2435 2440 2445 


US 2003/0224418 Al Dec. 4, 2003 

82 


-continued 

Arg Leu Asn Gly Glu lie Gin Ala Leu Glu Leu Pro Gin Lys Ala Glu 
2450 2455 2460 

Ala Leu Lys Leu Phe Leu Glu Glu Thr Lys Ala Thr Val Ala Val Tyr 

2465 2470 2475 2480 

Leu Glu Ser Leu Gin Asp Thr Lys lie Thr Leu lie lie Asn Trp Leu 
2485 2490 2495 

Gin Glu Ala Leu Ser Ser Ala Ser Leu Ala His Met Lys Ala Lys Phe 
2500 2505 2510 

Arg Glu Thr Leu Glu Asp Thr Arg Asp Arg Met Tyr Gin Met Asp lie 
2515 2520 2525 

Gin Gin Glu Leu Gin Arg Tyr Leu Ser Leu Val Gly Gin Val Tyr Ser 
2530 2535 2540 

Thr Leu Val Thr Tyr lie Ser Asp Trp Trp Thr Leu Ala Ala Lys Asn 

2545 2550 2555 2560 

Leu Thr Asp Phe Ala Glu Gin Tyr Ser lie Gin Asp Trp Ala Lys Arg 
2565 2570 2575 

Met Lys Ala Leu Val Glu Gin Gly Phe Thr Val Pro Glu lie Lys Thr 
2580 2585 2590 

lie Leu Gly Thr Met Pro Ala Phe Glu Val Ser Leu Gin Ala Leu Gin 
2595 2600 2605 

Lys Ala Thr Phe Gin Thr Pro Asp Phe lie Val Pro Leu Thr Asp Leu 
2610 2615 2620 

Arg He Pro Ser Val Gin He Asn Phe Lys Asp Leu Lys Asn lie Lys 

2625 2630 2635 2640 

lie Pro Ser Arg Phe Ser Thr Pro Glu Phe Thr lie Leu Asn Thr Phe 
2645 2650 2655 

His lie Pro Ser Phe Thr lie Asp Phe Val Glu Met Lys Val Lys He 
2660 2665 2670 

lie Arg Thr He Asp Gin Met Gin Asn Ser Glu Leu Gin Trp Pro Val 
2675 2680 2685 

Pro Asp lie Tyr Leu Arg Asp Leu Lys Val Glu Asp lie Pro Leu Ala 
2690 2695 2700 

Arg He Thr Leu Pro Asp Phe Arg Leu Pro Glu lie Ala He Pro Glu 

2705 2710 2715 2720 

Phe He lie Pro Thr Leu Asn Leu Asn Asp Phe Gin Val Pro Asp Leu 
2725 2730 2735 

His He Pro Glu Phe Gin Leu Pro His He Ser His Thr He Glu Val 
2740 2745 2750 

Pro Thr Phe Gly Lys Leu Tyr Ser He Leu Lys lie Gin Ser Pro Leu 
2755 2760 2765 

Phe Thr Leu Asp Ala Asn Ala Asp He Gly Asn Gly Thr Thr Ser Ala 
2770 2775 2780 

Asn Glu Ala Gly He Ala Ala Ser He Thr Ala Lys Gly Glu Ser Lys 

2785 2790 2795 2800 

Leu Glu Val Leu Asn Phe Asp Phe Gin Ala Asn Ala Gin Leu Ser Asn 
2805 2810 2815 

Pro Lys He Asn Pro Leu Ala Leu Lys Glu Ser Val Lys Phe Ser Ser 

2820 2825 2830 

Lys Tyr Leu Arg Thr Glu His Gly Ser Glu Met Leu Phe Phe Gly Asn 
2835 2840 2845 


Ala He Glu Gly Lys Ser Asn Thr Val Ala Ser Leu His Thr Glu Lys 
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Asn Thr Leu Glu Leu Ser Asn Gly Val lie Val Lys lie Asn Asn Gin 

2865 2870 2875 2880 

Leu Thr Leu Asp Ser Asn Thr Lys Tyr Phe His Lys Leu Asn He Pro 
2885 2890 2895 

Lys Leu Asp Phe Ser Ser Gin Ala Asp Leu Arg Asn Glu He Lys Thr 
2900 2905 2910 

Leu Leu Lys Ala Gly His lie Ala Trp Thr Ser Ser Gly Lys Gly Ser 
2915 2920 2925 

Trp Lys Trp Ala Cys Pro Arg Phe Ser Asp Glu Gly Thr His Glu Ser 
2930 2935 2940 

Gin lie Ser Phe Thr lie Glu Gly Pro Leu Thr Ser Phe Gly Leu Ser 

2945 2950 2955 2960 

Asn Lys lie Asn Ser Lys His Leu Arg Val Asn Gin Asn Leu Val Tyr 
2965 2970 2975 

Glu Ser Gly Ser Leu Asn Phe Ser Lys Leu Glu lie Gin Ser Gin Val 
2980 2985 2990 

Asp Ser Gin His Val Gly His Ser Val Leu Thr Ala Lys Gly Met Ala 
2995 3000 3005 

Leu Phe Gly Glu Gly Lye Ala Glu Phe Thr Gly Arg His Asp Ala His 
3010 3015 3020 

Leu Asn Gly Lys Val He Gly Thr Leu Lys Asn Ser Leu Phe Phe Ser 

3025 3030 3035 3040 

Ala Gin Pro Phe Glu He Thr Ala Ser Thr Asn Asn Glu Gly Asn Leu 
3045 3050 3055 

Lys Val Arg Phe Pro Leu Arg Leu Thr Gly Lys He Asp Phe Leu Asn 
3060 3065 3070 

Asn Tyr Ala Leu Phe Leu Ser Pro Ser Ala Gin Gin Ala Ser Trp Gin 
3075 3080 3085 

Val Ser Ala Arg Phe Asn Gin Tyr Lys Tyr Asn Gin Asn Phe Ser Ala 
3090 3095 3100 

Gly Asn Asn Glu Asn lie Met Glu Ala His Val Gly lie Asn Gly Glu 

3105 3110 3115 3120 

Ala Asn Leu Asp Phe Leu Asn lie Pro Leu Thr lie Pro Glu Met Arg 
3125 3130 3135 

Leu Pro Tyr Thr He He Thr Thr Pro Pro Leu Lys Asp Phe Ser Leu 
3140 3145 3150 

Trp Glu Lys Thr Gly Leu Lys Glu Phe Leu Lys Thr Thr Lys Gin Ser 
3155 3160 3165 

Phe Asp Leu Ser Val Lys Ala Gin Tyr Lys Lys Asn Lys His Arg His 
3170 3175 3180 

Ser He Thr Asn Pro Leu Ala Val Leu Cys Glu Phe He Ser Gin Ser 

3185 3190 3195 3200 

lie Lys Ser Phe Asp Arg His Phe Glu Lys Asn Arg Asn Asn Ala Leu 
3205 3210 3215 

Asp Phe Val Thr Lys Ser Tyr Asn Glu Thr Lys He Lys Phe Asp Lys 
3220 3225 3230 

Tyr Lys Ala Glu Lys Ser His Asp Glu Leu Pro Arg Thr Phe Gin He 
3235 3240 3245 

Pro Gly Tyr Thr Val Pro Val val Asn Val Glu val ser Pro Phe Thr 
3250 3255 3260 
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He Glu Met Ser Ala Phe Gly Tyr Val Phe Pro Lys Ala Val Ser Met 
3265 3270 3275 3280 

Pro Ser Phe Ser He Leu Gly Ser Asp Val Arg Val Pro Ser Tyr Thr 
3285 3290 3295 

Leu He Leu Pro Ser Leu Glu Leu Pro Val Leu His Val Pro Arg Asn 
3300 3305 3310 

Leu Lys Leu Ser Leu Pro His Phe Lys Glu Leu Cys Thr He Ser His 
3315 3320 3325 

He Phe He Pro Ala Met Gly Asn He Thr Tyr Asp Phe Ser Phe Lys 

3330 3335 3340 

Ser Ser Val He Thr Leu Asn Thr Asn Ala Glu Leu Phe Asn Gin Ser 
3345 3350 3355 3360 

Asp He Val Ala His Leu Leu Ser Ser Ser Ser Ser Val He Asp Ala 
3365 3370 3375 

Leu Gin Tyr Lys Leu Glu Gly Thr Thr Arg Leu Thr Arg Lys Arg Gly 
33B0 3385 3390 

Leu Lys Leu Ala Thr Ala Leu Ser Leu Ser Asn Lys Phe Val Glu Gly 
3395 3400 3405 

Ser His Asn Ser Thr Val Ser Leu Thr Thr Lys Asn Met Glu Val Ser 

3410 3415 3420 

Val Ala Lys Thr Thr Lys Ala Glu lie Pro He Leu Arg Met Asn Phe 
3425 3430 3435 3440 

Lys Gin Glu Leu Asn Gly Asn Thr Lys Ser Lys Pro Thr Val Ser Ser 
3445 3450 3455 

Ser Met Glu Phe Lys Tyr Asp Phe Asn Ser Ser Met Leu Tyr Ser Thr 
34S0 3465 3470 

Ala Lys Gly Ala Val Asp His Lys Leu Ser Leu Glu Ser Leu Thr Ser 
3475 3480 3485 

Tyr Phe Ser He Glu Ser Ser Thr Lys Gly Asp Val Lys Gly Ser Val 

3490 3495 3500 

Leu Ser Arg Glu Tyr Ser Gly Thr He Ala Ser Glu Ala Asn Thr Tyr 
3505 3510 3515 3520 

Leu Asn Ser Lys Ser Thr Arg Ser Ser Val Lys Leu Gin Gly Thr Ser 
3525 3530 3535 

Lys He Asp Asp He Trp Asn Leu Glu Val Lys Glu Asn Phe Ala Gly 
3540 3545 3550 

Glu Ala Thr Leu Gin Arg He Tyr Ser Leu Trp Glu His Ser Thr Lys 
3555 3560 3565 

Asn His Leu Gin Leu Glu Gly Leu Phe Phe Thr Asn Gly Glu His Thr 

3570 3575 3580 

Ser Lys Ala Thr Leu Glu Leu Ser Pro Trp Gin Met Ser Ala Leu Val 
3585 3590 3595 3600 

Gin Val His Ala Ser Gin Pro Ser Ser Phe His Asp Phe Pro Asp Leu 
3605 3610 3615 

Gly Gin Glu Val Ala Leu Asn Ala Asn Thr Lys Asn Gin Lys He Arg 
3620 3625 3630 

Trp Lys Asn Glu Val Arg lie His Ser Gly Ser Phe Gin Ser Gin Val 
3635 3640 3645 


Glu Leu Ser Asn Asp Gin Glu Lys Ala His Leu Asp lie Ala Gly Ser 
3S50 3655 3660 
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Leu Glu Gly His Leu Arg Phe Leu Lys Asn lie lie Leu Pro Val Tyr 
3665 3670 3675 3680 

Asp Lys Ser Leu Trp Asp Phe Leu Lys Leu Asp Val Thr Thr Ser lie 
3685 3690 3695 

Gly Arg Arg Gin His Leu Arg Val Ser Thr Ala Phe Val Tyr Thr Lys 
3700 3705 3710 

Asn Pro Asn Gly Tyr Ser Phe Ser lie Pro Val Lys Val Leu Ala Asp 
3715 3720 3725 

Lys Phe lie Thr Pro Gly Leu Lys Leu Asn Asp Leu Asn Ser Val Leu 

3730 3735 3740 

Val Met Pro Thr Phe His Val Pro Phe Thr Asp Leu Gin Val Pro Ser 
3745 3750 3755 3760 

Cys Lys Leu Asp Phe Arg Glu lie Gin lie Tyr Lys Lys Leu Arg Thr 
3765 3770 3775 

Ser Ser Phe Ala Leu Asn Leu Pro Thr Leu Pro Glu Val Lys Phe Pro 
3780 3785 3790 

Glu Val Asp Val Leu Thr Lys Tyr Ser Gin Pro Glu Asp Ser Leu lie 
3795 3800 3805 

Pro Phe Phe Glu lie Thr Val Pro Glu Ser Gin Leu Thr Val Ser Gin 

3810 3815 3820 

Phe Thr Leu Pro Lys Ser Val Ser Asp Gly lie Ala Ala Leu Asp Leu 
3825 3830 3835 3840 

Asn Ala Val Ala Asn Lys lie Ala Asp Phe Glu Leu Pro Thr lie lie 
3845 3850 3855 

Val Pro Glu Gin Thr lie Glu lie Pro Ser lie Lys Phe Ser Val Pro 
3860 3865 3870 

Ala Gly lie Val lie Pro Ser Phe Gin Ala Leu Thr Ala Arg Phe Glu 
3875 3880 3885 

Val Asp Ser Pro Val Tyr Asn Ala Thr Trp Ser Ala Ser Leu Lys Asn 

3890 3895 3900 

Lys Ala Asp Tyr Val Glu Thr Val Leu Asp Ser Thr Cys Ser Ser Thr 
3905 3910 3915 3920 

Val Gin Phe Leu Glu Tyr Glu Leu Asn Val Leu Gly Thr His Lye lie 
3925 3930 3935 

Glu Asp Gly Thr Leu Ala Ser Lys Thr Lys Gly Thr Leu Ala His Arg 
3940 3945 3950 

Asp Phe Ser Ala Glu Tyr Glu Glu Asp Gly Lys Phe Glu Gly Leu Gin 
3955 3960 3965 

Glu Trp Glu Gly Lys Ala His Leu Asn lie Lys Ser Pro Ala Phe Thr 

3970 3975 3980 

Asp Leu His Leu Arg Tyr Gin Lys Asp Lys Lys Gly lie Ser Thr Ser 
3985 3990 3995 4000 

Ala Ala Ser Pro Ala Val Gly Thr Val Gly Met Asp Met Asp Glu Asp 
4005 4010 4015 

Asp Asp Phe Ser Lys Trp Asn Phe Tyr Tyr Ser Pro Gin Ser Ser Pro 
4020 4025 4030 

Asp Lys Lys Leu Thr lie Phe Lys Thr Glu Leu Arg Val Arg Glu Ser 
4035 4040 4045 

Asp Glu Glu Thr Gin lie Lys Val Asn Trp Glu Glu Glu Ala Ala Ser 

4050 4055 4060 


Gly Leu Leu Thr Ser Leu Lys Asp Asn Val Pro Lys Ala Thr Gly Val 
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Leu Tyr Asp Tyr Val Asn Lys Tyr His Trp Glu His Thr Gly Leu Thr 
4085 4090 4095 

Leu Arg Glu Val Ser Ser Lys Leu Arg Arg Asn Leu Gin Asn Asn Ala 
4100 4105 4110 

Glu Trp Val Tyr Gin Gly Ala lie Arg Gin lie Asp Asp lie Asp Val 
4115 4120 4125 

Arg Phe Gin Lys Ala Ala Ser Gly Thr Thr Gly Thr Tyr Gin Glu Trp 
4130 4135 4140 

Lys Asp Lys Ala Gin Asn Leu Tyr Gin Glu Leu Leu Thr Gin Glu Gly 

4145 4150 4155 4160 

Gin Ala Ser Phe Gin Gly Leu Lys Asp Asn Val Phe Asp Gly Leu Val 
4165 4170 4175 

Arg Val Thr Gin Lys Phe His Met Lys Val Lys His Leu He Asp Ser 
4180 4185 4190 

Leu He Asp Phe Leu Asn Phe Pro Arg Phe Gin Phe Pro Gly Lys Pro 
4195 4200 4205 

Gly lie Tyr Thr Arg Glu Glu Leu Cys Thr Met Phe He Arg Glu Val 
4210 4215 4220 

Gly Thr Val Leu Ser Gin Val Tyr Ser Lye Val His Asn Gly Ser Glu 

4225 4230 4235 4240 

lie Leu Phe Ser Tyr Phe Gin Asp Leu Val lie Thr Leu Pro Phe Glu 
4245 4250 4255 

Leu Arg Lys His Lys Leu He Asp Val He Ser Met Tyr Arg Glu Leu 
4260 4265 4270 

Leu Lys Asp Leu Ser Lys Glu Ala Gin Glu Val Phe Lys Ala He Gin 
4275 4280 4285 

Ser Leu Lys Thr Thr Glu Val Leu Arg Asn Leu Gin Asp Leu Leu Gin 
4290 4295 4300 

Phe He Phe Gin Leu lie Glu Asp Asn lie Lys Gin Leu Lys Glu Met 

4305 4310 4315 4320 

Lys Phe Thr Tyr Leu He Asn Tyr lie Gin Asp Glu He Asn Thr lie 
4325 4330 4335 

Phe Asn Asp Tyr lie Pro Tyr Val Phe Lys Leu Leu Lys Glu Asn Leu 
4340 4345 4350 

Cys Leu Asn Leu His Lys Phe Asn Glu Phe He Gin Asn Glu Leu Gin 
4355 4360 4365 

Glu Ala Ser Gin Glu Leu Gin Gin He His Gin Tyr He Met Ala Leu 
4370 4375 4380 

Arg Glu Glu Tyr Phe Asp Pro Ser He Val Gly Trp Thr Val Lys Tyr 

43S5 4390 4395 4400 

Tyr Glu Leu Glu Glu Lys He Val Ser Leu He Lys Asn Leu Leu Val 
4405 4410 4415 

Ala Leu Lys Asp Phe His Ser Glu Tyr He Val Ser Ala Ser Asn Phe 
4420 4425 4430 

Thr Ser Gin Leu Ser Ser Gin Val Glu Gin Phe Leu His Arg Asn He 
4435 4440 4445 

Gin Glu Tyr Leu Ser He Leu Thr Asp Pro Asp Gly Lys Gly Lys Glu 
4450 4455 4460 

Lys He Ala Glu Leu ser Ala Thr Ala Gin Glu He He Lys ser Gin 

4465 4470 4475 4480 
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Ala lie Ala Thr Lys Lys lie lie Ser Asp Tyr His Gin Gin Phe Arg 
4485 4490 4495 

Tyr Lys Leu Gin Asp Phe Eer Asp Gin Leu Ser Asp Tyr Tyr Glu Lys 
4500 4505 4510 

Phe lie Ala Glu Ser Lye Arg Leu He Asp Leu Ser He Gin Asn Tyr 
4515 4520 4525 

His Thr Phe Leu lie Tyr lie Thr Glu Leu Leu Lys Lys Leu Gin Ser 
4530 4535 4540 

Thr Thr Val Met Asn Pro Tyr Met Lys Leu Ala Pro Gly Glu Leu Thr 
4545 4550 4555 4560 

He lie Leu 


<210> SEQ ID tJO 3 3 

<211> LENGTH: 219 6 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapien 

<2 2 0> FEATURE: 

<221> NAME/KEY : CDS 

<2 22> LOCATION: (13)... (1983) 

<223> OTHER IH FORMAT I OH : nucleotide sequence encoding 
5, 10-methylenetetrahydof olate reductase (MTHFR) 

<4 0 0> SEQUENCE: 3 3 

aattccggag cc atg gtg aac gaa gee aga gga aac age age etc aac ccc 5 1 

Met Val Asn Glu Ala Arg Gly Asn Ser Ser Leu Asn Pro 

1 5 10 

tgc ttg gag ggc agt gec age agt ggc agt gag age tec aaa gat agt 9 9 

Cys Leu Glu Gly Ser Ala Ser Ser Gly Ser Glu Ser Ser Lys Asp Ser 
15 20 25 

teg aga tgt tec ace ccg ggc ctg gac cct gag egg cat gag aga etc 14 7 

Ser Arg Cys Ser Thr Pro Gly Leu Asp Pro Glu Arg His Glu Arg Leu 
30 35 40 45 

egg gag aag atg agg egg cga ttg gaa tct ggt gac aag tgg ttc tec 195 
Arg Glu Lys Met Arg Arg Arg Leu Glu Ser Gly Asp Lys Trp Phe Ser 
50 55 60 

ctg gaa ttc ttc cct cct cga act get gag gga get gtc aat etc ate 24 3 

Leu Glu Phe Phe Pro Pro Arg Thr Ala Glu Gly Ala Val Asn Leu He 
65 70 75 

tea agg ttt gac egg atg gca gca ggt ggc ccc etc tac ata gac gtg 29 1 

Ser Arg Phe Asp Arg Met Ala Ala Gly Gly Pro Leu Tyr He Asp Val 
80 85 90 

acc tgg cac cca gca ggt gac cct ggc tea gac aag gag acc tec tec 33 9 

Thr Trp His Pro Ala Gly Asp Pro Gly Ser Asp Lys Glu Thr Ser Ser 
95 100 105 

atg atg ate gec age acc gec gtg aac tac tgt ggc ctg gag acc ate 38 7 

Met Met lie Ala Ser Thr Ala Val Asn Tyr Cys Gly Leu Glu Thr He 
110 115 120 125 

ctg cac atg acc tgc tgc cgt cag cgc ctg gag gag ate acg ggc cat 435 
Leu His Met Thr Cys Cys Arg Gin Arg Leu Glu Glu He Thr Gly His 
130 135 140 

ctg cac aaa get aag cag ctg ggc ctg aag aac ate atg gcg ctg egg 48 3 

Leu His Lys Ala Lys Gin Leu Gly Leu Lys Asn He Met Ala Leu Arg 
145 150 155 

gga gac cca ata ggt gac cag tgg gaa gag gag gag gga ggc ttc aac 53 1 

Gly Asp Pro He Gly Asp Gin Trp Glu Glu Glu Glu Gly Gly Phe Asn 
160 165 170 


tac gca gtg gac ctg gtg aag cac ate cga agt gag ttt ggt gac tac 


579 
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Tyr Ala Val Asp Leu Val Lys His lie Arg Ser Glu Phe Gly Asp Tyr 

175 180 185 

ttt gac ate tgt gtg gca ggt tac ccc aaa ggc cac ccc gaa gca ggg 

Phe Asp lie Cys Val Ala Gly Tyr Pro Lys Gly His Pro Glu Ala Gly 

190 195 200 205 

age ttt gag get gac ctg aag cac ttg aag gag aag gtg tct gcg gga 

Ser Phe Glu Ala Asp Leu Lys His Leu Lys Glu Lys Val Ser Ala Gly 

210 215 220 

gee gat ttc ate ate acg cag ctt ttc ttt gag get gac aca ttc ttc 

Ala Asp Phe lie lie Thr Gin. Leu Phe Phe Glu Ala Asp Thr Phe Phe 

225 230 235 

cgc ttt gtg aag gca tgc ace gac atg ggc ate act tgc ggg ate gtc 

Arg Phe Val Lys Ala Cys Thr Asp Met Gly lie Thr Cys Pro lie Val 

240 245 250 

ccc ggg ate ttt ccc ate cag ggc tac cac tec ctt egg cag ctt gtg 

Pro Gly lie Phe Pro lie Gin. Gly Tyr His Ser Leu Arg Gin Leu Val 

255 260 265 

aag ctg tec aag ctg gag gtg cca cag gag ate aag gac gtg att gag 

Lys Leu Ser Lys Leu Glu Val Pro Gin Glu lie Lys Asp Val lie Glu 

270 275 280 285 

cca ate aaa gac aac gat get gec ate cgc aac tat ggc ate gag ctg 

Pro He Lys Asp Asn Asp Ala Ala Tie Arg Asn Tyr Gly Tie Glu Leu 

290 295 300 

gee gtg age ctg tgc cag gag ctt atg gec agt ggc ttg gtg cca ggc 

Ala Val Ser Leu Cys Gin Glu Leu Leu Ala Ser Gly Leu Val Pro Gly 

305 310 315 

etc cac ttc tac acc etc aac cgc gag atg get acc aca gag gtg ctg 

Leu His Phe Tyr Thr Leu Asn Arg Glu Met Ala Thr Thr Glu Val Leu 

320 325 330 

aag cgc ctg ggg atg tgg act gag gac ccc agg cgt ccc eta ccc tgg 

Lys Arg Leu Gly Met Trp Thr Glu Asp Pro Arg Arg Pro Leu Pro Trp 

335 340 345 

get etc agt gee cac ccc aag cgc cga gag gaa gat gta cgt ccc ate 

Ala Leu Ser Ala His Pro Lys Arg Arg Glu Glu Asp Val Arg Pro He 

350 355 3S0 365 

ttc tgg gee tec aga cca aag agt tac ate tac cgt acc eag gag tgg 

Phe Trp Ala Ser Arg Pro Lys Ser Tyr He Tyr Arg Thr Gin Glu Trp 

370 375 380 

gac gag ttc cct aac ggc cgc tgg ggc aat tec tct tec cct gec ttt 

Asp Glu Phe Pro Asn Gly Arg Trp Gly Asn Ser Ser Ser Pro Ala Phe 

385 390 395 

ggg gag ctg aag gac tac tac etc ttc tac ctg aag age aag tec ccc 

Gly Glu Leu Lys Asp Tyr Tyr Leu Phe Tyr Leu Lys Ser Lys Ser Pro 

400 405 410 

aag gag gag ctg ctg aag atg tgg ggg gag gag ctg acc agt gaa gca 

Lys Glu Glu Leu Leu Lys Met Trp Gly Glu Glu Leu Thr Ser Glu Ala 

415 420 425 

agt gtc ttt gaa gtc ttt gtt ctt tae etc teg gga gaa cca aac egg 

Ser Val Phe Glu Val Phe Val Leu Tyr Leu Ser Gly Glu Pro Asn Arg 

430 435 440 445 

aat ggt cac aaa gtg act tgc ctg ggg tgg aac gat gag ggg ctg gcg 

Asn Gly His Lys Val Thr Cys Leu Pro Trp Asn Asp Glu Pro Leu Ala 

450 455 460 

get gag acc age ctg ctg aag gag gag ctg ctg egg gtg aac cgc cag 

Ala Glu Thr Ser Leu Leu Lys Glu Glu Leu Leu Arg Val Asn Arg Gin 

465 470 475 

ggc ate etc acc ate aac tea cag ccc aac ate aac ggg aag ccg tec 
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Gly lie Leu Thr lie Asn Ser Gin Pro Asn lie Asn Gly Lys Pro Ser 

480 485 490 

tec gac cc c ate gtg ggc tgg ggc c-cc- age ggg ggc tat gtc ttc cag 153 9 

Ser Asp Pro lie Val Gly Trp Gly Pro Ser Gly Gly Tyr Val Phe Gin 
495 500 505 

aag gec tac tta gag ttt ttc act tec cgc gag aca gcg gaa gca ctt 158 7 

Lys Ala Tyr Leu Glu Phe Phe Thr Ser Arg Glu Thr Ala Glu Ala Leu 
510 515 520 525 

ctg caa gtg ctg aag aag tac gag etc egg gtt aat tac cac ctt gtc 1635 
Leu Gin Val Leu Lys Lys Tyr Glu Leu Arg Val Asn Tyr His Leu Val 
530 535 540 

aat gtg aag ggt gaa aac ate ace aat gec cct gaa ctg cag ccg aat 168 3 

Asn Val Lys Gly Glu Asn lie Thr Asn Ala Pro Glu Leu Gin Pro Asn 
545 550 555 

get gtc act tgg ggc ate ttc cct ggg cga gag ate ate cag ccc acc 173 1 

Ala Val Thr Trp Gly lie Phe Pro Gly Arg Glu lie lie Gin Pro Thr 
560 565 570 

gta gtg gat ccc gtc age ttc atg ttc tgg aag gac gag gec ttt gec 177 9 

Val Val Asp Pro Val Ser Phe Met Phe Trp Lys Asp Glu Ala Phe Ala 
575 580 585 

ctg tgg att gag egg tgg gga aag ctg tat gag gag gag tec ccg tec 182 7 

Leu Trp Tie Glu Arg Trp Gly Lys Leu Tyr Glu Glu Glu Ser Pro Ser 
590 595 600 605 

cgc acc ate ate cag tac ate cac gac aac tac ttc ctg gtc aac ctg 1875 
Arg Thr lie lie Gin Tyr lie His Asp Asn Tyr Phe Leu Val Asn Leu 
610 615 620 

gtg gac aat gac ttc cca ctg gac aac tgc etc tgg cag gtg gtg gaa 192 3 

Val Asp Asn Asp Phe Pro Leu Asp Asn Cys Leu Trp Gin Val Val Glu 
625 630 635 

gac aca ttg gag ctt etc aac agg ccc acc cag aat gcg aga gaa acg 197 1 

Asp Thr Leu Glu Leu Leu Asn Arg Pro Thr Gin Asn Ala Arg Glu Thr 
640 645 650 

gag get cca tga ccctgcgtcc tgacgccctg cgttggagcc actcctgtcc 2 02 3 

Glu Ala Pro * 
655 

cgccttcctc ctccacagtg ctgcttctct tgggaactcc actctccttc gtgtctctcc 2 08 3 

caccccggcc tccactcccc cacctgacaa tggcagctag actggagtga ggcttccagg 2 143 

ctcttcctgg acctgagtcg gccccacatg ggaacctagt actctctgct eta 2 196 

<210> SEQ ID HO 3 4 

<211> LENGTH: 65 6 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapien 

<4 0 0> SEQUENCE: 3 4 

Met Val Asn Glu Ala Arg Gly Asn Ser Ser Leu Asn Pro Cys Leu Glu 
15 10 15 

Gly Ser Ala Ser Ser Gly Ser Glu Ser Ser Lys Asp Ser Ser Arg Cys 
20 25 30 

Ser Thr Pro Gly Leu Asp Pro Glu Arg His Glu Arg Leu Arg Glu Lys 
35 40 45 

Met Arg Arg Arg Leu Glu Ser Gly Asp Lys Trp Phe Ser Leu Glu Phe 
50 55 60 


Phe Pro Pro Arg Thr Ala Glu Gly Ala Val Asn Leu lie Ser Arg Phe 
65 70 75 80 
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Asp Arg Met Ala Ala Gly Gly Pro Leu Tyr lie Asp Val Thr Trp His 

85 90 95 

Pro Ala Gly Asp Pro Gly Eer Asp Lys Glu Thr Ser Ser Met Met lie 
100 105 110 

Ala Ser Thr Ala Val Asn Tyr Cys Gly Leu Glu Thr lie Leu His Met 
115 120 125 

Thr Cys Cys Arg Gin Arg Leu Glu Glu lie Thr Gly His Leu His Lys 
130 135 140 

Ala Lys Gin Leu Gly Leu Lys Asn lie Met Ala Leu Arg Gly Asp Pro 
145 150 155 160 

He Gly Asp Gin Trp Glu Glu Glu Glu Gly Gly Phe Asn Tyr Ala Val 
165 170 175 

Asp Leu Val Lys His He Arg Ser Glu Phe Gly Asp Tyr Phe Asp He 
ISO 185 190 

Cys Val Ala Gly Tyr Pro Lys Gly His Pro Glu Ala Gly Ser Phe Glu 
195 200 205 

Ala Asp Leu Lys His Leu Lys Glu Lys Val Ser Ala Gly Ala Asp Phe 
210 215 220 

lie lie Thr Gin Leu Phe Phe Glu Ala Asp Thr Phe Phe Arg Phe Val 
225 230 235 240 

Lys Ala Cys Thr Asp Met Gly He Thr Cys Pro He Val Pro Gly He 

245 250 255 

Phe Pro He Gin Gly Tyr His Ser Leu Arg Gin Leu Val Lys Leu Ser 
260 265 270 

Lys Leu Glu Val Pro Gin Glu He Lys Asp Val He Glu Pro He Lys 
275 2S0 285 

Asp Asn Asp Ala Ala He Arg Asn Tyr Gly He Glu Leu Ala Val Ser 
290 295 300 

Leu Cys Gin Glu Leu Leu Ala Ser Gly Leu Val Pro Gly Leu His Phe 
305 310 315 320 

Tyr Thr Leu Asn Arg Glu Met Ala Thr Thr Glu Val Leu Lys Arg Leu 
325 330 335 

Gly Met Trp Thr Glu Asp Pro Arg Arg Pro Leu Pro Trp Ala Leu Ser 
340 345 350 

Ala His Pro Lys Arg Arg Glu Glu Asp Val Arg Pro He Phe Trp Ala 
355 360 365 

Ser Arg Pro Lys Ser Tyr lie Tyr Arg Thr Gin Glu Trp Asp Glu Phe 
370 375 380 

Pro Asn Gly Arg Trp Gly Asn Ser Ser Ser Pro Ala Phe Gly Glu Leu 
385 390 395 400 

Lys Asp Tyr Tyr Leu Phe Tyr Leu Lys Ser Lys Ser Pro Lys Glu Glu 
405 410 415 

Leu Leu Lys Met Trp Gly Glu Glu Leu Thr Ser Glu Ala Ser Val Phe 

420 425 430 

Glu Val Phe Val Leu Tyr Leu Ser Gly Glu Pro Asn Arg Asn Gly His 

435 440 445 

Lys Val Thr Cys Leu Pro Trp Asn Asp Glu Pro Leu Ala Ala Glu Thr 

450 455 460 

Ser Leu Leu Lys Glu Glu Leu Leu Arg Val Asn Arg Gin Gly He Leu 
465 470 475 480 

Thr He Asn Ser Gin Pro Asn He Asn Gly Lys Pro Ser Ser Asp Pro 
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He Val Gly Trp Gly Pro Ser Gly Gly Tyr Val Phe Gin Lys Ala Tyr 
500 505 510 

Leu Glu Phe Phe Thr Ser Arg Glu Thr Ala Glu Ala Leu Leu Gin Val 
515 520 525 

Leu Lys Lys Tyr Glu Leu Arg Val Asn Tyr His Leu Val Asn Val Lys 
530 535 540 

Gly Glu Asn He Thr Asn Ala Pro Glu Leu Gin Pro Asn Ala Val Thr 
545 550 555 560 

Trp Gly lie Phe Pro Gly Arg Glu He He Gin Pro Thr Val Val Asp 
565 570 575 

Pro Val Ser Phe Met Phe Trp Lys Asp Glu Ala Phe Ala Leu Trp He 
580 585 590 

Glu Arg Trp Gly Lys Leu Tyr Glu Glu Glu Ser Pro Ser Arg Thr He 
595 600 605 

He Gin Tyr He His Asp Asn Tyr Phe Leu Val Asn Leu Val Asp Asn 
610 615 620 

Asp Phe Pro Leu Asp Asn Cys Leu Trp Gin Val Val Glu Asp Thr Leu 
625 630 635 640 

Glu Leu Leu Asn Arg Pro Thr Gin Asn Ala Arg Glu Thr Glu Ala Pro 
645 650 655 

<210> SEQ ID NO 35 

<211> LENGTH: 333 4 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapien 

<220> FEATURE: 

<221> NAME/KEY: CDS 

<222> LOCATION: ( 1 1 7 ) . . . < 19 49 ) 

<223> OTHER IK FORMAT I OK : Nucleotide sequence encoding select in E (SELE) 
<400> SEQUENCE: 35 

cctgagacag aggcagcagt gatacccacc tgagagatcc tgtgtttgaa caactgcttc 6C 

ccaaaacgga aagtatttca agcctaaacc tttgggtgaa aagaactctt gaagtc atg 11S 

Met 
1 

att get tea cag ttt etc tea get etc act ttg gtg ctt etc att aaa 16 7 

lie Ala Ser Gin Phe Leu Ser Ala Leu Thr Leu Val Leu Leu lie Lys 
5 10 15 

gag agt gga gee tgg tct tac aac ace tec acg gaa get atg act tat 215 
Glu Ser Gly Ala Trp Ser Tyr Asn Thr Ser Thr Glu Ala Met Thr Tyr 
20 25 30 

gat gag gec agt get tat tgt cag caa agg tac aca cac ctg gtt gca 26 2 

Asp Glu Ala Ser Ala Tyr Cys Gin Gin Arg Tyr Thr His Leu Val Ala 
35 40 45 

att caa aac aaa gaa gag att gag tac eta aac tec ata ttg age tat 311 
He Gin Asn Lys Glu Glu He Glu Tyr Leu Asn Ser He Leu Ser Tyr 
50 55 60 65 

tea cca agt tat tac tgg att gga ate aga aaa gtc aac aat gtg tgg 35 S 

Ser Pro Ser Tyr Tyr Trp He Gly He Arg Lys Val Asn Asn Val Trp 
70 75 80 

gtc tgg gta gga ace cag aaa act ctg aca gaa gaa gec aag aac tgg 40 7 

Val Trp Val Gly Thr Gin Lys Pro Leu Thr Glu Glu Ala Lys Asn Trp 


get cca ggt gaa ccc aac aat agg caa aaa gat gag gac tgc gtg gag 
Ala Pro Gly Glu Pro Asn Asn Arg Gin Lys Asp Glu Asp Cys Val Glu 


455 
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ate tac ate aag aga gaa aaa gat gtg ggc atg tgg aat gat gag agg 

lie Tyr lie Lys Arg Glu Lys Asp Val Gly Met Trp Aen Asp Glu Arg 

115 120 125 

tgc age aag aag aag ctt gec eta tgc tac aca get gec tgt acc aat 

Cys Ser Lys Lys Lys Leu Ala Leu Cys Tyr Thr Ala Ala Cys Thr Asn 

130 135 140 145 

aca tec tgc agt ggc cac ggt gaa tgt gta gag acc ate aat aat tac 

Thr Ser Cys Ser Gly His Gly Glu Cys Val Glu Thr lie Asn Asn Tyr 

150 155 160 

act tgc aag tgt gac cct ggc ttc agt gga etc aag tgt gag caa att 

Thr Cys Lys Cys Asp Pro Gly Phe Ser Gly Leu Lys Cys Glu Gin lie 

165 170 175 

gtg aac tgt aca gec ctg gaa tec cct gag cat gga age ctg gtt tgc 

Val Asn Cys Thr Ala Leu Glu Ser Pro Glu His Gly Ser Leu Val Cys 

180 185 190 

agt cac cca ctg gga aac ttc age tac aat tct tec tgc tct ate age 

Ser His Pro Leu Gly Asn Phe Ser Tyr Asn Ser Ser Cys Ser lie Ser 

195 200 205 

tgt gat agg ggt tac ctg cca age age atg gag acc atg cag tgt atg 

Cys Asp Arg Gly Tyr Leu Pro Ser Ser Met Glu Thr Met Gin Cys Met 

210 215 220 225 

tec tct gga gaa tgg agt get cct att cca gee tgc aat gtg gtt gag 

Ser Ser Gly Glu Trp Ser Ala Pro lie Pro Ala Cys Asn Val Val Glu 

230 235 240 

tgt gat get gtg aca aat cca gec aat ggg ttc gtg gaa tgt ttc caa 

Cys Asp Ala Val Thr Asn Pro Ala Asn Gly Phe Val Glu Cys Phe Gin 

245 250 255 

aac cct gga age ttc cca tgg aac aca acc tgt aca ttt gac tgt gaa 

Asn Pro Gly Ser Phe Pro Trp Asn Thr Thr Cys Thr Phe Asp Cys Glu 

260 265 270 

gaa gga ttt gaa eta atg gga gec cag age ctt cag tgt acc tea tct 

Glu Gly Phe Glu Leu Met Gly Ala Gin Ser Leu Gin Cys Thr Ser Ser 

275 280 285 

ggg aat tgg gac aac gag aag cca acg tgt aaa get gtg aca tgc agg 

Gly Asn Trp Asp Asn Glu Lys Pro Thr Cys Lys Ala Val Thr Cys Arg 

290 295 300 305 

gec gtc egc cag cct cag aat ggc tct gtg agg tgc age cat tec cct 

Ala Val Arg Gin Pro Gin Abu Gly Ser Val Arg Cys Ser His Ser Pro 

310 315 320 

get gga gag ttc ace ttc aaa tea tec tgc aac ttc acc tgt gag gaa 

Ala Gly Glu Phe Thr Phe Lys Ser Ser Cys Asn Phe Thr Cys Glu Glu 

325 330 335 

ggc ttc atg ttg cag gga cca gec cag gtt gaa tgc acc act caa ggg 

Gly Phe Met Leu Gin Gly Pro Ala Gin Val Glu Cys Thr Thr Gin Gly 

340 345 350 

cag tgg aca cag caa ate cca gtt tgt gaa get ttc cag tgc aca gec 

Gin Trp Thr Gin Gin lie Pro Val Cys Glu Ala Phe Gin Cys Thr Ala 

355 360 365 

ttg tec aac ccc gag cga ggc tac atg aat tgt ctt cct agt get tct 

Leu Ser Asn Pre- Glu Arg Gly Tyr Met Asn Cys Leu Pro Ser Ala Ser 

370 375 380 385 

ggc agt ttc egt tat ggg tec age tgt gag ttc tec tgt gag cag ggt 

Gly Ser Phe Arg Tyr Gly Ser Ser Cys Glu Phe Ser Cys Glu Gin Gly 

390 395 400 

ttt gtg ttg aag gga tec aaa agg etc caa tgt ggc ccc aca ggg gag 

Phe Val Leu Lys Gly Ser Lys Arg Leu Gin Cys Gly Pro Thr Gly Glu 
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tgg gac aac gag aag ccc aca tgt gaa get gtg aga tgc gat get gtc 

Trp Asp Asn. Glu Lys Pro Thr Cys Glu Ala Val Arg Cys Asp Ala Val 

420 425 430 

cac cag ccc ccg aag ggt "ttg gtg agg tgt get cat tec cct att gga 

His Gin Pro Pro Lys Gly Leu Val Arg Cys Ala His Ser Pro lie Gly 

435 440 445 

gaa ttc acc tac aag tec tct tgt gec ttc age tgt gag gag gga ttt 

Glu Phe Thr Tyr Lys Ser Ser Cys Ala Phe Ser Cys Glu Glu Gly Phe 

450 455 460 465 

gaa tta tat gga tea act caa ctt gag tgc aca tct cag gga caa tgg 

Glu Leu Tyr Gly Ser Thr Gin. Leu Glu Cys Thr Ser Gin Gly Gin Trp 

470 475 480 

aca gaa gag gtt cct tec tgc caa gtg gta aaa tgt tea age ctg gca 

Thr Glu Glu Val Pro Ser Cys Gin Val Val Lys Cys Ser Ser Leu Ala 

485 490 495 

gtt ccg gga aag ate aac atg age tge agt ggg gag ccc gtg ttt ggc 

Val Pro Gly Lye lie Asn Met Ser Cys Ser Gly Glu Pro Val Phe Gly 

500 505 510 

act gtg tgc aag ttc gec tgt cct gaa gga tgg acg etc aat ggc tct 

Thr Val Cys Lys Phe Ala Cys Pro Glu Gly Trp Thr Leu Asn Gly Ser 

515 520 525 

gca get egg aca tgt gga gee aca gga cac tgg tct ggc ctg eta cct 

Ala Ala Arg Thr Cys Gly Ala Thr Gly His Trp Ser Gly Leu Leu Pro 

530 535 540 545 

acc tgt gaa get ccc act gag tec aac att ccc ttg gta get gga ctt 

Thr Cys Glu Ala Pro Thr Glu Ser Asn lie Pro Leu Val Ala Gly Leu 

550 555 560 

tct get get gga etc tec etc ctg aca tta gca cca ttt etc etc tgg 

Ser Ala Ala Gly Leu Ser Leu Leu Thr Leu Ala Pro Phe Leu Leu Trp 

565 570 575 

ctt egg aaa tgc tta egg aaa gca aag aaa ttt gtt cct gee age age 

Leu Arg Lys Cys Leu Arg Lys Ala Lys Lys Phe Val Pro Ala Ser Ser 

580 585 590 

tgc caa age ctt gaa tea gac gga age tac caa aag cct tct tac ate 

Cys Gin Ser Leu Glu Ser Asp Gly Ser Tyr Gin Lys Pro Ser Tyr lie 

595 600 605 

ctt taa gttcaaaaga atcagaaaca ggtgcatctg gggaactaga gggatacact 

610 


gaagttaaca 

gagacagata 

actctcctcg 

ggtctctggc 

ccttcttgcc 

tactatgeca 

2059 

gatgecttta 

tggctgaaac 

cgcaacaccc 

atcaccactt 

caatagatca 

aagtccagca 

2 119 

ggcaaggacg 

gccttcaact 

gaaaagactc 

agtgttccct 

ttcctactct 

caggatcaag 

2 179 

aaagtgttgg 

ctaatgaagg 

gaaaggat at 

tttcttccaa 

gcaaaggtga 

agagaccaag 

2239 

actctgaaat 

ctcagaattc 

cttttctaac 

tctcccttgc 

tcgctgtaaa 

atcttggcac 

2299 

agaaacacaa 

tattttgtgg 

ctttctttet 

tttgcccttc 

acagtgtttc 

gacagctgat 

2359 

tacacagttg 

ctgtcataag 

aatgaataat 

aattatccag 

agtttagagg 

aaaaaaatga 

2419 

ctaaaaatat 

tataacttaa 

aaaaatgaca 

gatgttgaat 

gcccacagge 

aaatgcatgg 

2479 

agggttgtta 

atggtgcaaa 

tcctactgaa 

tgctctgtgc 

gagggttact 

atgeacaatt 

2539 

taatcacttt 

catccctatg 

ggattcagtg 

cttcttaaag 

agttcttaag 

gattgtgata 

2599 

tttttacttg 

cattgaatat 

attataatct 

tccatacttc 

ttcattcaat 

acaagtgtgg 

2659 
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tagggactta 

aaaaacttgt 

aaatgctgtc 

aactatgata 

tggtaaaagt 

tacttattct 

2719 

agattacccc 

ctcattgttt 

attaacaaat 

tatgttacat 

ctgttttaaa 

tttatttcaa 

2779 

aaagggaaac 

tattgtcccc 

tagcaaggca 

tgatgttaac 

cagaataaag 

ttctgagtgt 

2839 

ttttactaca 

gttgtttttt 

gaaaacatgg 

tagaattgga 

gagtaaaaac 

tgaatggaag 

2899 

gtttgtatat 

tgtcagatat 

tttttcagaa 

atatgtggtt 

tccacgatga 

aaaacttcca 

2959 

tgaggccaaa 

cgttttgaac 

taataaaagc 

ataaatgcaa 

acacacaaag 

gtataatttt 

3019 

atgaatgtct 

ttgttggaaa 

agaatacaga 

aagatggatg 

tgctttgcat 

tcctacaaag 

3079 

atgtttgtca 

gat g-tgatat 

gtaaacataa 

ttcttgtata 

ttatggaaga 

ttttaaattc 

3 139 

acaatagaaa 

ctcaccatgt 

aaaagagtca 

tctggtagat 

ttttaacgaa 

tgaagatgtc 

3 199 

taatagttat 

tccctatttg 

ttttcttctg 

tatgttaggg 

tgctctggaa 

gagaggaatg 

3259 

cctgtgtgag 

caagcattta 

tgtttattta 

taagcagatt 

taacaattcc 

aaaggaatct 

3319 

ccagttttca 

gttgatcact 

ggcaatgaaa 

aattntcagt 

cagtaattgc 

caaagctgct 

3379 

ctagccttga 

ggagtgtgag 

aatcaaaact 

ctcctacact 

tccattaact 

tagcatgtgt 

3439 

tgaaaaaaaa 

agtttcagag 

aagttctggc 

tgaacactgg 

caacgacaaa 

gccaacagtc 

3499 

aaaacagaga 

tgtgataagg 

atcagaanag 

cagaggttct 

tttaaagggg 

cagaaaaact 

3559 

ctgggaaata 

agagagaaca 

actactgtga 

tcaggctatg 

tatggaatac 

agtgttattt 

3619 

tctttgaaat 

tgtttaagtg 

ttgtaaat at 

ttatgtaaac 

tgcattagaa 

attagctgtg 

3679 

tgaaatacca 

gtgtggtttg 

tgtttgagtt 

ttattgagaa 

ttttaaatta 

taacttaaaa 

3739 

tattttataa 

tttttaaagt 

atatatttat 

ttaagcttat 

gtcagaccta 

tttgacataa 

3799 

cactataaag 

gttgacaata 

aatgtgctta 

tgttt 



3834 


<210> SEQ ID NO 3 6 

<211> LENGTH: 610 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapien 

<400> SEQUENCE: 3 6 

Met He Ala Ser Gin Phe Leu Ser Ala Leu Thr Leu Val Leu Leu He 
15 10 15 

Lys Glu Ser Gly Ala Trp Ser Tyr Asn Thr Ser Thr Glu Ala Met Thr 
20 25 30 

Tyr Asp Glu Ala Ser Ala Tyr Cys Gin Gin Arg Tyr Thr His Leu Val 
35 40 45 

Ala lie Gin Asn Lys Glu Glu He Glu Tyr Leu Asn Ser He Leu Ser 
50 55 60 

Tyr Ser Pro Ser Tyr Tyr Trp He Gly He Arg Lys Val Asn Asn Val 
65 70 75 80 

Trp Val Trp Val Gly Thr Gin Lys Pro Leu Thr Glu Glu Ala Lys Asn 


Trp Ala Pro Gly Glu Pro Asn Asn Arg Gin Lys Asp Glu Asp Cys Val 
100 105 110 

Glu He Tyr He Lys Arg Glu Lys Asp Val Gly Met Trp Asn Asp Glu 
115 120 125 

Arg Cys Ser Lys Lys Lys Leu Ala Leu Cys Tyr Thr Ala Ala Cys Thr 
130 135 140 

Asn Thr Ser Cys Ser Gly His Gly Glu Cys Val Glu Thr He Asn Asn 
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Tyr Thr Cys Lys Cys Asp Pro Gly Phe Ser Gly Leu Lys Cys Glu Gin 
165 170 175 

lie Val Asn Cys Thr Ala Leu Glu Ser Pro Glu His Gly Ser Leu Val 
180 185 190 

Cys Ser His Pro Leu Gly Asn Phe Ser Tyr Asn Ser Ser Cys Ser lie 
195 200 205 

Ser Cys Asp Arg Gly Tyr Leu Pro Ser Ser Met Glu Thr Met Gin Cys 
210 215 220 

Met Ser Ser Gly Glu Trp Ser Ala Pro lie Pro Ala Cys Asn Val Val 
225 230 235 240 

Glu Cys Asp Ala Val Thr Asn Pro Ala Asn Gly Phe Val Glu Cys Phe 
245 250 255 

Gin Asn Pro Gly Ser Phe Pro Trp Asn Thr Thr Cys Thr Phe Asp Cys 
260 265 270 

Glu Glu Gly Phe Glu Leu Met Gly Ala Gin Ser Leu Gin Cys Thr Ser 
275 2S0 285 

Ser Gly Asn Trp Asp Asn Glu Lys Pro Thr Cys Lys Ala Val Thr Cys 
290 295 300 

Arg Ala Val Arg Gin Pro Gin Asn Gly Ser Val Arg Cys Ser His Ser 
305 310 315 320 

Pro Ala Gly Glu Phe Thr Phe Lys Ser Ser Cys Asn Phe Thr Cys Glu 
325 330 335 

Glu Gly Phe Met Leu Gin Gly Pro Ala Gin Val Glu Cys Thr Thr Gin 
340 345 350 

Gly Gin Trp Thr Gin Gin lie Pro Val Cys Glu Ala Phe Gin Cys Thr 
355 360 365 

Ala Leu Ser Asn Pro Glu Arg Gly Tyr Met Asn Cys Leu Pro Ser Ala 
370 375 380 

Ser Gly Ser Phe Arg Tyr Gly Ser Ser Cys Glu Phe Ser Cys Glu Gin 
385 390 395 400 

Gly Phe Val Leu Lys Gly Ser Lys Arg Leu Gin Cys Gly Pro Thr Gly 
405 410 415 

Glu Trp Asp Asn Glu Lys Pro Thr Cys Glu Ala Val Arg Cys Asp Ala 
420 425 430 

Val His Gin Pro Pro Lys Gly Leu Val Arg Cys Ala His Ser Pro lie 

435 440 445 

Gly Glu Phe Thr Tyr Lys Ser Ser Cys Ala Phe Ser Cys Glu Glu Gly 
450 455 460 

Phe Glu Leu Tyr Gly Ser Thr Gin Leu Glu Cys Thr Ser Gin Gly Gin 
465 470 475 480 

Trp Thr Glu Glu Val Pro Ser Cys Gin Val Val Lys Cys Ser Ser Leu 
485 490 495 

Ala Val Pro Gly Lys lie Asn Met Ser Cys Ser Gly Glu Pro Val Phe 
500 505 510 

Gly Thr Val Cys Lys Phe Ala Cys Pro Glu Gly Trp Thr Leu Asn Gly 
515 520 525 

Ser Ala Ala Arg Thr Cye Gly Ala Thr Gly His Trp Ser Gly Leu Leu 
530 535 540 

Pro Thr cys Glu Ala Pro Thr Glu ser Asn lie Pro Leu val Ala Gly 
545 550 555 560 
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Leu Ser Ala Ala Gly Leu Ser Leu Leu Thr Leu Ala Pro Phe Leu Leu 
565 570 575 


Trp Leu Arg Lys Cys Leu Arg Lys Ala Lys Lys Phe Val Pro Ala Ser 
580 585 590 


Ser Cys Gin Ser Leu Glu Ser Asp Gly Ser Tyr Gin Lys Pro Ser Tyr 
595 600 605 


He Leu 
610 


<210> SEQ ID HO 3 7 

<211> LENGTH: 192 2 

<212> TYPE: DHA 

<213> ORGANISM: Homo sapien 

<220> FEATURE: 

<221> NAME/KEY: CDS 

<2 22> LOCATION: ( 406 )...( 1428 ) 

<223> OTHER IK FORMAT I OK : Nucleotide sequence encoding nucleotide binding 
protein (G Protein), beta polypeptide 3 (GTJB3) 

<400> SEQUENCE: 3 7 

ccacaatagg ggcagacctg tccatccttc tctgtgggtc ccctgtacct ttctccccca 6 0 

acaggatcag acocagaggc agctggttgg ggtttgtcga gaagaaggat tatccagatc 

agtcctttct aatctcagct cctgcctgta ccctcccata ctcaccaaac cctcttcccc 

accaccctga gctgaggagc acagtttgag gcccccccaa ccccccgccg gtcggggcca 

ggccaggcca ggccagctcc tctggcagca gagcctgggc aggtgacggg cgggcgcggg 

cgtcgcagct gagggagtaa ggaggctccc aggaaccgga gctggaaacc cggccgaggt 


ccagccagag cccaagagcc agagtgacc 


ctcgacctgt cagcc atg ggg gag atg 
Met Gly Glu Met 


120 
180 
240 
300 
360 
417 


gag caa ctg cgt cag gaa gcg gag cag etc aag aag cag att gca gat 
Glu Gin Leu Arg Gin Glu Ala Glu Gin Leu Lys Lys Gin He Ala Asp 


gec agg aaa gee tgt get gac gtt act ctg gca gag ctg gtg tct ggc 
Ala Arg Lys Ala Cys Ala Asp Val Thr Leu Ala Glu Leu Val Ser Gly 


eta gag gtg gtg gga cga gtc cag atg egg acg egg egg acg tta agg 
Leu Glu Val Val Gly Arg Val Gin Met Arg Thr Arg Arg Thr Leu Arg 


gga cac ctg gee aag att tac gee atg cac tgg gec act gat tct aag 
Gly His Leu Ala Lys He Tyr Ala Met His Trp Ala Thr Asp Ser Lys 


ctg ctg gta agt gec teg caa gat ggg aag ctg ate gtg tgg gac age 
Leu Leu Val Ser Ala Ser Gin Asp Gly Lys Leu lie Val Trp Asp Ser 


tac ace acc aac aag gtg cac gec ate cca ctg cgc tec tec tgg gtc 

Tyr Thr Thr Asn Lys Val His Ala He Pro Leu Arg Ser Ser Trp Val 

85 90 95 100 

atg acc tgt gec tat gec cca tea ggg aac ttt gtg gca tgt ggg ggg 

Met Thr Cys Ala Tyr Ala Pro Ser Gly Asn Phe Val Ala Cys Gly Gly 

105 110 115 


ctg gac aac atg tgt tec ate tac aac etc aaa tec cgt gag ggc aat 
Leu Asp Asn Met Cys Ser He Tyr Asn Leu Lys Ser Arg Glu Gly Asn 
120 125 130 


gtc aag gtc age egg gag ctt tct get cac aca ggt tat etc tec tgc 
Val Lye Val Ser Arg Glu Leu Ser Ala His Thr Gly Tyr Leu Ser Cys 


849 
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tgc cgc ttc ctg gat gac aac aat att gtg acc age teg ggg gac acc 89 7 

Cys Arg Phe Leu Asp Asp Asn Asn lie Val Thr Ser Ser Gly Asp Thr 

150 155 ieo 

acg tgt gec ttg tgg gac att gag act ggg nag cag aag act gta ttt 945 

Thr Cys Ala Leu Trp Asp He Glu Thr Gly Gin Gin Lys Thr Val Phe 

165 170 175 180 

gtg gga cac acg ggt gac tgc atg age ctg get gtg tct cct gac ttc 99 3 

Val Gly His Thr Gly Asp Cys Met Ser Leu Ala Val Ser Pro Asp Phe 

185 190 195 

aat etc ttc att teg ggg gec tgt gat gec agt gee aag etc tgg gat 104 1 

Asn Leu Phe He Ser Gly Ala Cys Asp Ala Ser Ala Lys Leu Trp Asp 
200 205 210 

gtg cga gag ggg acc tgc cgt cag act ttc act ggc cac gag teg gac 108 9 

Val Arg Glu. Gly Thr Cye Arg Gin Thr Phe Thr Gly His Glu Ser Asp 
215 220 225 

ate aac gec ate tgt ttc ttc ccc aat gga gag gec ate tgc acg ggc 1137 

lie Asn Ala He Cys Phe Phe Pro Asn Gly Glu Ala He Cys Thr Gly 
230 235 240 

teg gat gac get tec tgc cgc ttg ttt gac ctg egg gca gac cag gag 1185 

Ser Asp Asp Ala Ser Cys Arg Leu Phe Asp Leu Arg Ala Asp Gin Glu 

245 250 255 260 

ctg ate tgc ttc tec cac gag age ate ate tgc ggc ate acg tec gtg 123 3 

Leu He Cys Phe Ser His Glu Ser He He Cys Gly He Thr Ser Val 
265 270 275 

gec ttc tec etc agt ggc cgc eta eta ttc get ggc tac gac gac ttc 128 1 

Ala Phe Ser Leu Ser Gly Arg Leu Leu Phe Ala Gly Tyr Asp Asp Phe 
280 285 290 

aac tgc aat gtc tgg gac tec atg aag tct gag cgt gtg ggc ate etc 132 9 

Asn Cys Asn Val Trp Asp Ser Met Lys Ser Glu Arg Val Gly He Leu 
295 300 305 

tct ggc cac gat aac agg gtg age tgc ctg gga gtc aca get gac ggg 137 7 

Ser Gly His Asp Asn Arg Val Ser Cys Leu Gly Val Thr Ala Asp Gly 
310 315 320 

atg get gtg gee aca ggt tec tgg gac age ttc etc aaa ate tgg aac 1425 

Met Ala Val Ala Thr Gly Ser Trp Asp Ser Phe Leu Lys He Trp Asn 

325 330 335 340 

tga ggaggctgga gaaagggaag tggaaggcag tgaacacact cagcagcccc 147 8 

ctgcccgacc ccatctcatt caggtgttct cttctatatt ccgggtgcca ttcccactaa 153 8 

gctttctcct ttgagggcag tggggagcat gggactgtgc ctttgggagg cagcatcagg 159 8 

gacacagggg caaagaactg ccccatctcc tcccatggcc ttccctcccc acagtcctca 1658 

cagcctctcc cttaatgagc aaggacaacc tgcccctccc cagccctttg caggcccagc 1718 

agacttgagt ctgaggcccc aggecctagg attcctcccc cagagccact acctttgtcc 177 8 

aggcctgggt ggtatagggc gtttggccct gtgactatgg ctctggcacc actagggtcc 183 8 

tggecetett cttattcatg ctttctcctt tttctacctt tttttctctc ctaagacace 189 8 

tgcaataaag tgtagcaccc tggt 1922 

<210> SEQ ID HO 3 8 

<211> LENGTH: 340 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapien 


<4 0 0> SEQUENCE: 3 3 
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Met Gly Glu Met Glu Gin Leu Arg Gin Glu Ala Glu Gin Leu Lys Lys 

15 10 15 

Gin He Ala Asp Ala Arg Lys Ala Cys Ala Asp Val Thr Leu Ala Glu 
20 25 30 

Leu Val Ser Gly Leu Glu Val Val Gly Arg Val Gin Met Arg Thr Arg 
35 40 45 

Arg Thr Leu Arg Gly Hie Leu Ala Lys lie Tyr Ala Met His Trp Ala 
50 55 60 

Thr Asp Ser Lys Leu Leu Val Ser Ala Ser Gin Asp Gly Lys Leu lie 
65 70 75 80 

Val Trp Asp Ser Tyr Thr Thr Asn Lys Val His Ala He Pro Leu Arg 


Ser Ser Trp Val Met Thr Cys Ala Tyr Ala Pro Ser Gly Asn Phe Val 
100 105 110 

Ala Cys Gly Gly Leu Asp Asn Met Cys Ser lie Tyr Asn Leu Lys Ser 
115 120 125 

Arg Glu Gly Asn Val Lys Val Ser Arg Glu Leu Ser Ala His Thr Gly 
130 135 140 

Tyr Leu Ser Cys Cys Arg Phe Leu Asp Asp Asn Asn He Val Thr Ser 
145 150 155 160 

Ser Gly Asp Thr Thr Cys Ala Leu Trp Asp He Glu Thr Gly Gin Gin 

165 170 175 

Lys Thr Val Phe Val Gly His Thr Gly Asp Cys Met Ser Leu Ala Val 
180 185 190 

Ser Pro Asp Phe Asn Leu Phe He Ser Gly Ala Cys Asp Ala Ser Ala 
195 200 205 

Lys Leu Trp Asp Val Arg Glu Gly Thr Cys Arg Gin Thr Phe Thr Gly 
210 215 220 

His Glu Ser Asp He Asn Ala He Cys Phe Phe Pro Asn Gly Glu Ala 
225 230 235 240 

He Cys Thr Gly Ser Asp Asp Ala Ser Cys Arg Leu Phe Asp Leu Arg 
245 250 255 

Ala Asp Gin Glu Leu He Cys Phe Ser His Glu Ser He He Cys Gly 
260 265 270 

He Thr Ser Val Ala Phe Ser Leu Ser Gly Arg Leu Leu Phe Ala Gly 
275 280 285 

Tyr Asp Asp Phe Asn Cys Asn Val Trp Asp Ser Met Lys Ser Glu Arg 
290 295 300 

Val Gly lie Leu Ser Gly His Asp Asn Arg Val Ser Cys Leu Gly Val 
305 310 315 320 

Thr Ala Asp Gly Met Ala Val Ala Thr Gly Ser Trp Asp Ser Phe Leu 
325 330 335 


Lys He Trp Asn 

340 


<210> SEQ ID NO 3 9 

<211> LEMGTH: 244 3 

<212> TYPE: DHA 

<213> ORGANISM: Homo sapien 

<2 2 0> FEATURE: 

<221> NAME/KEY : CDS 

<2 22> LOCATIONS (162). ..(12 53) 

<223> OTHER IH FORMAT l OH : nucleotide sequence encoding angiotensin 
receptor 2 (AGTR2) 
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<4 0 0> SEQUENCE: 3 9 

acgtcccagc gtctgagaga acgagtaagc aagaattcaa agcattctgc agcctgaatt 6 0 

ttgaaggagt gtgtttaggc actaagcaag ctgatttatg ataactgctt taaacttcaa 12 0 

caaccaaagg cataagaact aggagctgct gacatttcaa t atg aag ggc aac tec 17 6 

Met Lys Gly Asn Ser 

1 5 

acc ctt gec act act age aaa aac att acc age ggt ctt cac ttc ggg 22 4 

Thr Leu Ala Thr Thr Ser Lys Asn lie Thr Ser Gly Leu His Phe Gly 

10 15 20 

ctt gtg aac ate tct ggc aac aat gag tet acc ttg aac tgt tea cag 272 

Leu Val Asn. lie Ser Gly Asn Asn Glu Ser Thr Leu Asn Cys Ser Gin 

25 30 35 

aaa cca tea gat aag cat tta gat gca att cct att ctt tac tac att 32 0 

Lys Pro Ser Asp Lys His Leu Asp Ala lie Pro lie Leu Tyr Tyr lie 

40 45 50 

ata ttt gta att gga ttt ctg gtc aat att gtc gtg gtt aca ctg ttt 36 8 

lie Phe Val lie Gly Phe Leu Val Asn lie Val Val Val Thr Leu Phe 

55 60 65 

tgt tgt caa aag ggt cct aaa aag gtt tct age ata tac ate ttc aac 416 

Cys Cys Gin Lys Gly Pro Lys Lys Val Ser Ser Tie Tyr Tie Phe Asn 

70 75 SO 85 

etc get gtg get gat tta etc ctt ttg get act ctt cct eta tgg gca 46 4 

Leu Ala Val Ala Asp Leu Leu Leu Leu Ala Thr Leu Pro Leu Trp Ala 

90 95 100 

acc tat tat tct tat aga tat gac tgg etc ttt gga cct gtg atg tgc 512 

Thr Tyr Tyr Ser Tyr Arg Tyr Asp Trp Leu Phe Gly Pro Val Met Cys 

105 110 115 

aaa gtt ttt ggt tet ttt ctt acc ctg aac atg ttt gca age att ttt 56 0 

Lys Val Phe Gly Ser Phe Leu Thr Leu Asn Met Phe Ala Ser lie Phe 

120 125 130 

ttt ate acc tgc atg agt gtt gat agg tac caa tct gtc ate tac ccc 60 8 

Phe lie Thr Cys Met Ser Val Asp Arg Tyr Gin Ser Val lie Tyr Pro 

135 140 145 

ttt ctg tct caa aga aga aat ccc tgg caa gca tct tat ata gtt ccc 65 6 

Phe Leu Ser Gin Arg Arg Asn Pro Trp Gin Ala Ser Tyr lie Val Pro 

150 155 ISO 165 

ctt gtt tgg tgt atg gec tgt ttg tec tea ttg cca aca ttt tat ttt 70 4 

Leu Val Trp Cys Met Ala Cys Leu Ser Ser Leu Pro Thr Phe Tyr Phe 

170 175 ISO 

cga gac gtc aga acc att gaa tac tta gga gtg aat get tgc att atg 752 

Arg Asp Val Arg Thr lie Glu Tyr Leu Gly Val Asn Ala Cys lie Met 

185 190 195 

get ttc cca cct gag aaa tat gec caa tgg tea get ggg att gee tta 80 0 

Ala Phe Pro Pro Glu Lys Tyr Ala Gin Trp Ser Ala Gly lie Ala Leu 

200 205 210 

atg aaa aat ate ctt ggt ttt att ate cct tta ata ttc ata gca aca 84 8 

Met Lys Asn lie Leu Gly Phe lie lie Pro Leu He Phe He Ala Thr 

215 220 225 

tgc tat ttt gga att aga aaa cae tta ctg aag acg aat age tat ggg 89 6 

Cys Tyr Phe Gly lie Arg Lys His Leu Leu Lys Thr Asn Ser Tyr Gly 

230 235 240 245 

aag aac agg ata acc cgt gac caa gtc ctg aag atg gca get get gtt 94 4 

Lys Asn Arg He Thr Arg Asp Gin Val Leu Lys Met Ala Ala Ala Val 

250 255 260 


gtt ctg gee ttc ate att tgg tgc ctt ccc ttc cat gtt ctg acc ttc 


992 
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Val Leu Ala Phe lie lie Trp Cys Leu Pro Phe His Val Leu Thr Phe 

265 270 275 

ctg gat get ctg gec tgg atg ggt gtc att aat age tgc gaa gtt ata 

Leu Asp Ala Leu Ala Trp Met Gly Val lie Asn Ser Cys Glu Val lie 

280 285 290 

gca gtc att gac ctg gca ctt cct ttt gec ate etc ttg gga ttc acc 

Ala Val lie Asp Leu Ala Leu Pro Phe Ala lie Leu Leu Gly Phe Thr 

295 300 305 

aac age tgc gtt aat ccg ttt ctg tat tgt ttt gtt gga aac egg ttc 

Asn Ser Cys Val Asn Pro Phe Leu Tyr Cys Phe Val Gly Asn Arg Phe 

310 315 320 325 

caa Gag aag etc cgc agt gtg ttt agg gtt cca att act tgg etc caa 

Gin Gin Lys Leu Arg Ser Val Phe Arg Val Pro lie Thr Trp Leu Gin 

330 335 340 

ggg aaa aga gag agt atg tct tgc egg aaa age agt tct ctt aga gaa 

Gly Lys Arg Glu Ser Met Ser Cys Arg Lys Ser Ser Ser Leu Arg Glu 

345 350 355 

atg gag acc ttt gtg tct taa aeggagagea aaatgcatgt aatcaacatg 

Met Glu Thr Phe Val Ser * 
360 


getacttget 

ttgaggctca 

ccagaattat 

ttttaagtgg 

ttttaataaa 

ataataaaat 

1343 

ttcccctaat 

cttttctgaa 

tcttctgaaa 

ccaaatgtaa 

ctatgtttat 

cgtccagtga 

1403 

ctttcaggaa 

tgcccattgt 

tttctgatat 

gtttgtacaa 

gat ttc at tg 

gtgagacata 

1463 

tttacaacct 

agaagtaact 

ggtgatat at 

ctcaaattgt 

aattaataat 

agattgtgaa 

1523 

taatgatttg 

gggattcaga 

tttctctttg 

aaacatgett 

gtgtttctta 

gtggggtttt 

1583 

atatccattt 

ttatcaggat 

ttcctcttga 

accagaacca 

gtctttcaac 

teattgeate 

1643 

atttacaaga 

caacattgta 

agagagatga 

gcacttctaa 

gttgagtata 

ttataataga 

1703 

ttagtactgg 

att attcagg 

ctttaggcat 

atgcttcttt 

aaaaaegcta 

taaattatat 

1763 

tcctcttgca 

tttcacttga 

gtggaggttt 

atagttaatc 

tataactaca 

tattgaatag 

1823 

ggctaggaat 

atagattaaa 

tcatactcct 

atgetttage 

ttatttttac 

agttatagaa 

1883 

agcaagatgt 

act at aac at 

agaattgeaa 

tctataatat 

ttgtgtgttc 

actaaactct 

1943 

gaataagcac 

tttttaaaaa 

actttctact 

cattttaatg 

attgtttaaa 

ggtttctatt 

2003 

ttctctgata 

cttttttgaa 

atcagtaaac 

actgtgtatt 

gttgtaaaat 

gtaaaggtca 

2063 

cttttcacat 

ccttgacttt 

ttagatgtgc 

tgctttgata 

tataggacat 

tgatttgatt 

2 123 

tttattatta 

atgctttggt 

tctgggtt gt 

ttcctaaaat 

atctgggtgg 

cttaaaaaaa 

2 183 

actctttaac 

ttgtaataaa 

cccttaactg 

gcataggaaa 

tggtatccag 

aatggaattt 

2243 

tgctacatgg 

ggtctgggtg 

ggggcaaaga 

gacccagtca 

att ac at gtt 

tggtaccaag 

2303 

aaaggaacct 

gtcagggcag 

tacaatgtga 

ctttgaaaat 

atatacegtg 

ggggtagttt 

2363 

taoootatat 

ctataaacac 

tgtttgttcc 

agaatctgta 

tgattctatg 

gagctatttt 

2423 

aaaccaattg 

caggtctaga 





2443 


<210> SEQ ID HO 4 0 

<211> LENGTH: 363 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapien 


<4 0 0> SEQUENCES 4 0 

Met Lye Gly Asn Ser Thr Leu Ala Thr Thr Ser Lys Asn lie Thr Ser 
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15 10 15 

Gly Leu His Phe Gly Leu Val Asn lie Ser Gly Asn Asn Glu Ser Thr 
20 25 30 

Leu Asn Cys Ser Gin Lys Pro Ser Asp Lys His Leu Asp Ala He Pro 
35 40 45 

He Leu Tyr Tyr He He Phe Val He Gly Phe Leu Val Asn He Val 
50 55 60 

Val Val Thr Leu Phe Cys Cys Gin Lys Gly Pro Lys Lys Val Ser Ser 
65 70 75 80 

He Tyr lie Phe Asn Leu Ala Val Ala Asp Leu Leu Leu Leu Ala Thr 

85 90 95 

Leu Pro Leu Trp Ala Thr Tyr Tyr Ser Tyr Arg Tyr Asp Trp Leu Phe 
100 105 110 

Gly Pro Val Met Cys Lys Val Phe Gly Ser Phe Leu Thr Leu Asn Met 
115 120 125 

Phe Ala Ser He Phe Phe lie Thr Cys Met Ser Val Asp Arg Tyr Gin 
130 135 140 

Sex- Val lie Tyr Pro Phe Leu Ser Gin Arg Arg Asn Pro Trp Gin Ala 
145 150 155 160 

Ser Tyr lie Val Pro Leu Val Trp Cys Met Ala Cys Leu Ser Ser Leu 
165 170 175 

Pro Thr Phe Tyr Phe Arg Asp Val Arg Thr He Glu Tyr Leu Gly Val 
180 185 190 

Asn Ala Cys He Met Ala Phe Pro Pro Glu Lys Tyr Ala Gin Trp Ser 
195 200 205 

Ala Gly lie Ala Leu Met Lys Asn He Leu Gly Phe He He Pro Leu 
210 215 220 

lie Phe lie Ala Thr Cys Tyr Phe Gly lie Arg Lys His Leu Leu Lys 
225 230 235 240 

Thr Asn Ser Tyr Gly Lys Asn. Arg lie Thr Arg Asp Gin Val Leu Lys 
245 250 255 

Met Ala Ala Ala Val Val Leu. Ala Phe He He Trp Cys Leu Pro Phe 
260 265 270 

His Val Leu Thr Phe Leu Asp Ala Leu Ala Trp Met Gly Val He Asn 
275 280 2S5 

Ser Cys Glu Val He Ala Val He Asp Leu Ala Leu Pro Phe Ala He 
290 295 300 

Leu Leu Gly Phe Thr Asn Ser Cys Val Asn Pro Phe Leu Tyr Cys Phe 
305 310 315 320 

Val Gly Asn Arg Phe Gin Gin Lys Leu Arg Ser Val Phe Arg Val Pro 
325 330 335 

He Thr Trp Leu Gin Gly Lys Arg Glu Ser Met Ser Cys Arg Lys Ser 
340 345 350 

Ser Ser Leu Arg Glu Met Glu Thr Phe Val Ser 
355 360 


<210> SEQ ID HO 41 

<211> LENGTH: 2 0 

<212> TYPE: DHA 

<213> ORGANISM: Artificial Sequence 

<2 2 0> FEATURES 

<223> OTHER IH FORMAT I OH : Primer 


US 2003/0224418 Al 


Dec. 4, 2003 


102 


-continued 


<4 0 0> SEQUENCE: 41 


actgcctgat aaocat^ctg 


20 


<210> SEQ ID HO 42 
<211> LENGTH: 2 0 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<400> SEQUENCE: 42 

atacttacac accagga.ggg 2 0 


<210> SEQ ID NO 4 3 
<211> LENGTH: 19 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE : 

<223> OTHER INFORMATION: Primer 
<400> SEQUENCE: 4 3 

atgcctgctc caaaggcac 19 


<210> SEQ ID NO 4 4 

<211> LENGTH: 2 0 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 

<4 0 0> SEQUENCE: 4 4 

atgcctgctc caaaggcacc 2 0 


<210> SEQ ID NO 45 

<211> LENGTH: 21 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 

<400> SEQUENCE: 45 

atgcctgctc caaaggcaca t 2 1 


<210> SEQ ID NO 4 6 
<211> LENGTH: 2 0 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Primer 
<400> SEQUENCE: 4 6 

tacttctggt tctctgagcg 2 0 


<210> SEQ ID NO 4 7 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 4 7 


actcaccttg aactcgtctc 


20 
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<210> SEQ ID HO 4 8 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER IN FORMAT I OH : Primer 
<400> SEQUENCE: 4 8 

tggttctctg agcgagtctt 2 0 


<210> SEQ ID NO 4 9 
<211> LENGTH: 21 
<212> TYPE : DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER IK FORMAT I OK : Primer 
<4 0 0> SEQUENCE: 4 9 

tggttctctg agcgagtctt c 2 1 


<210> SEQ ID NO 5 0 
<211> LENGTH: 22 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 5 0 

tggttctctg agcgagtctt tc 22 


<210> SEQ ID NO 51 
<211> LENGTH: 2 0 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 51 

tgcagatgga ctttggcttc 2 0 


<210> SEQ ID NO 52 
<211> LENGTH: 2 0 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<400> SEQUENCE: 52 

tgcttgcctt ctgctacaag 2 0 


<210> SEQ ID NO 5 3 
<211> LENGTH: 19 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 5 3 

cttccctgag cacctgctg 19 


<210> SEQ ID NO 5 4 
<211> LENGTH: 21 
<212> TYPE: DNA 
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<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 


<4 0 0> SEQUENCE: 5 4 
cttccctgag cacctgctgg t 


<210> SEQ ID NO 55 
<211> LENGTH: 2 0 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<400> SEQUENCE: 55 

cttccctgag cacctgctga 2 0 


<210> SEQ ID NO 5 6 
<211> LENGTH: 2 0 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 5 6 

aacagctcag gacgaaactg 2 0 


<210> SEQ ID NO 5 7 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Primer 
<400> SEQUENCE: 5 7 

agaaggagtt gaccttgtcc 2 0 


<210> SEQ ID NO 5 8 

<211> LENGTH: 19 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<2 2 0> FEATURE: 

<223> OTHER INFORMATION : Primer 

<4 0 0> SEQUENCE: 5 8 

ggaagctcaa gtggccttc 19 


<210> SEQ ID NO 5 9 
<211> LENGTH: 2 0 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 5 9 

ggaagctcaa gtggccttcc 2 0 


<210> SEQ ID NO 6 0 

<211> LENGTH: 22 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
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<4 0 0> SEQUENCE: 6 0 


ggaagctcaa gtggccttca ac 


22 


<210> SEQ ID HO 61 
<211> LENGTH: 19 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 61 

aagtcactgg cagagctgg 19 


<210> SEQ ID NO 62 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE : 

<223> OTHER INFORMATION: Primer 
<400> SEQUENCE: 62 

gcaccagggc tttgttgaag 2 0 


<210> SEQ ID NO 6 3 

<211> LENGTH: 19 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 


<210> SEQ ID NO 6 4 
<211> LENGTH: 2 0 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 6 4 

ttttccccgt agggctccac 2 0 


<210> SEQ ID NO 65 
<211> LENGTH: 21 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Primer 
<400> SEQUENCE: 65 

ttttccccgt agggctccag c 2 1 


<210> SEQ ID NO 6 6 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 6 6 

tgcagaagtc actggcagag 2 0 


<4 0 0> SEQUENCE: 6 3 


ttttci 


:gt agggctcca 


19 
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<210> SEQ ID HO 6 7 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER IN FORMAT I OH : Primer 
<400> SEQUENCE: 6 7 

gttgaagttt tccccgtagg 2 0 


<210> SEQ ID NO 6 3 
<211> LENGTH: 19 
<212> TYPE : DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER IK FORMAT I OK : Primer 
<4 0 0> SEQUENCE: 6 8 

actcctccac ctgctggtc 19 


<210> SEQ ID NO 6 9 
<211> LENGTH: 2 0 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 6 9 

actcctccac ctgctggtcc 2 0 


<210> SEQ ID NO 7 0 
<211> LENGTH: 21 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 7 0 

actcctccac ctgctggtct a 2 1 


<210> SEQ ID NO 71 
<211> LENGTH: 2 0 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 


<210> SEQ ID NO 72 

<211> LENGTH: 2 0 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 

<400> SEQUENCE: 72 

agctctgcca gtgacttctg 2 0 


<4 0 0> SEQUENCE: 71 


aggacgtgcg tggcaacctg 


20 


<210> SEQ ID NO 7 3 
<211> LENGTH: 19 
<212> TYPE: DNA 
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<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 


<4 0 0> SEQUENCE: 7 3 
gtgacttctg cagcccctc 


<210> SEQ ID NO 7 4 
<211> LENGTH : 2 0 
<212> TYPE: DNA 

<213> ORGANISM: Artitifical sequence 
<4 0 0> SEQUENCE: 7 4 

gtgacttctg cagcccctca 2 0 


<210> SEQ ID NO 75 
<211> LENGTH: 22 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<400> SEQUENCE: 75 

gtgacttctg cagcccctcg gt 22 


<210> SEQ ID NO 7 6 
<211> LENGTH: 19 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 7 6 

cctgaccttc cagatgaag 19 


<210> SEQ ID NO 7 7 
<211> LENGTH: 19 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 7 7 

tcaggttgcc acgcacgtc 19 


<210> SEQ ID NO 7 3 
<211> LENGTH: IS 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 7 3 

caggatctcg gccagtgc 18 


<210> SEQ ID NO 7 9 
<211> LENGTH: 19 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 


<4 0 0> SEQUENCE: 7 9 
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caggatctcg gccagtgcc 


19 


<210> SEQ ID NO 8 0 
<211> LENGTH: 2 0 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 8 0 

caggatctcg gccagtgctg 2 0 


<210> SEQ ID NO 81 
<211> LENGTH : 2 0 
<212> TYPE: DHA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 81 

acctgcgaga gcttcagcag 2 0 


<210> SEQ ID NO 82 
<211> LENGTH: 2 0 
<212> TYPE: DHA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<400> SEQUENCE: 82 

tctcoatgcg ctgtgcgtag 2 0 


<210> SEQ ID NO 8 3 
<211> LENGTH: IS 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE : 

<223> OTHER INFORMATION: Primer 
<400> SEQUENCE: 8 3 

agctgcgcac ccaggtca 18 


<210> SEQ ID NO 8 4 
<211> LENGTH: 19 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE : 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 8 4 

agctgcgcac ccaggtcaa 19 


<210> SEQ ID NO 85 
<211> LENGTH: 2 0 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE : 

<223> OTHER INFORMATION: Primer 
<400> SEQUENCE: 85 

agctgcgcac ccaggtcagc 2 0 


<210> SEQ ID NO 8 6 
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<211> LENGTH: 19 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER IN FORMAT I OK : Primer 
<4 0 0> SEQUENCE: 8 6 

tgtccaagga gctgcaggc 19 


<210> SEQ ID HO 8 7 
<211> LENGTH : 2 0 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<400> SEQUENCE: 8 7 

cttacgcagc ttgcgcaggt 2 0 


<210> SEQ ID NO 8B 

<211> LENGTH: IS 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION : Primer 

<4 0 0> SEQUENCE: 8 8 

gcggacatgg aggacgtg 18 


<210> SEQ ID NO 8 9 

<211> LENGTH: 19 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 

<4 0 0> SEQUENCE: 8 9 

gcggacatgg aggacgtgc 19 


<210> SEQ ID NO 9 0 
<211> LENGTH: 2 0 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 9 0 

gcggacatgg aggacgtgtg 2 0 


<210> SEQ ID NO 91 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Primer 
<400> SEQUENCE: 91 

gttgtagaaa gaaccgctgc 2 0 


<210> SEQ ID NO 92 
<211> LENGTH: 2 0 
<212> TYPE: DNA 

<213> ORGANISM: Artificial sequence 
<2 2 0> FEATURE: 
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<223> OTHER IH FORMAT I OH : Primer 


<400> SEQUENCE: 92 


gagaacgagt cttoaggtac 


20 


<210> SEQ ID HO 9 3 
<211> LENGTH: 21 
<212> TYPE: DHA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER IH FORMAT I OH : Primer 
<4 0 0> SEQUENCE: 9 3 

acaatctggg ctatgagatc a 2 1 


<210> SEQ ID HO 9 4 
<211> LENGTH: 22 
<212> TYPE: DHA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER IH FORMAT I OH : Primer 
<4 0 0> SEQUENCE: 9 4 

acaatctggg ctatgagatc aa 22 


<210> SEQ ID HO 95 
<211> LENGTH: 2 3 
<212> TYPE: DHA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER IH FORMAT I OH : Primer 
<400> SEQUENCE: 95 

acaatctggg ctatgagatc agt 2 3 


<210> SEQ ID HO 9 6 
<211> LENGTH: 2 0 
<212> TYPE: DHA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 9 6 

cactctacac tgcatgtctc 2 0 


<210> SEQ ID HO 9 7 
<211> LENGTH: 2 0 
<212> TYPE: DHA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER IH FORMAT I OH : Primer 
<4 0 0> SEQUENCE: 9 7 

acccttctga aaaggagagg 2 0 


<210> SEQ ID HO 9 8 
<211> LENGTH: 2 0 
<212> TYPE: DHA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 


<4 0 0> SEQUENCE: 9 3 
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gaggagagac aaggcagata 


20 


<210> SEQ ID NO 9 9 
<211> LENGTH: 21 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 9 9 

gaggagagac aaggcagata t 2 1 


<210> SEQ ID HO 100 
<211> LENGTH : 22 
<212> TYPE: DHA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 100 

gaggagagac aaggcagata gt 22 


<210> SEQ ID NO 101 
<211> LENGTH: 2 0 
<212> TYPE: DHA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<400> SEQUENCE: 101 

aaaggttcag ttgctgctgc 2 0 


<210> SEQ ID NO 102 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE : 

<223> OTHER INFORMATION: Primer 
<400> SEQUENCE: 102 

gctggggaag gtctaataac 2 0 


<210> SEQ ID NO 103 
<211> LENGTH: 19 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE : 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 103 

gttgctgctg cctcgaatc 19 


<210> SEQ ID NO 104 
<211> LENGTH: 2 0 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 104 

gttgctgctg cctcgaatcc 2 0 


<210> SEQ ID NO 105 
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<211> LENGTH: 21 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER IN FORMAT I OK : Primer 
<4 0 0> SEQUENCE: 105 

gttgctgctg cctcgaatct g 21 


<210> SEQ ID NO 106 
<211> LENGTH : 2 0 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<400> SEQUENCE: 106 

cgtctttctc cagatgatgc 2 0 


<210> SEQ ID NO 107 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION : Primer 
<4 0 0> SEQUENCE: 107 

agtgtcctat gggctgtttg 2 0 


<210> SEQ ID NO 108 

<211> LENGTH: 21 

<212> TYPE: DNA 

<213> ORGANISM: Arti_Eicial Sequence 

<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 

<4 0 0> SEQUENCE: 108 

ggatgccatt cataccttta c 2 1 


<210> SEQ ID NO 109 

<211> LENGTH: 22 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 

<4 0 0> SEQUENCE: 109 

ggatgccatt cataccttta cc 2 2 


<210> SEQ ID NO 110 
<211> LENGTH: 23 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Primer 
<400> SEQUENCE: 110 

ggatgccatt cataccttta cgc 2 3 


<210> SEQ ID NO 111 
<211> LENGTH: 2 0 
<212> TYPE: DNA 

<213> ORGANISM: Artificial sequence 
<2 2 0> FEATURE: 
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<223> OTHER IN FORMAT I OH : Primer 


<400> SEQUENCE: 111 


tgggaaaaca gtgcagtgtg 


20 


<210> SEQ ID HO 112 
<211> LENGTH: 20 
<212> TYPE: DHA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER IH FORMAT I OH : Primer 
<4 0 0> SEQUENCE: 112 

tgatcgtctt cagaacgagg 2 0 


<210> SEQ ID HO 113 

<211> LENGTH: 2 3 

<212> TYPE: DHA 

<213> ORGANISM: Artificial Sequence 

<2 2 0> FEATURE: 

<223> OTHER IH FORMAT I OH : Primer 

<4 0 0> SEQUENCE: 113 

ccagaccatc atcatcccat 99 a 2 3 


<210> SEQ ID HO 114 
<211> LENGTH: 21 
<212> TYPE: DHA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER IH FORMAT I OH : Primer 


<210> SEQ ID HO 115 
<211> LENGTH: 22 
<212> TYPE: DHA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 115 

ccagaccatc atcccatgga gc 22 


<210> SEQ ID HO 116 
<211> LENGTH: 2 0 
<212> TYPE: DHA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER IH FORMAT I OH : Primer 
<4 0 0> SEQUENCE: 116 

cagcaatcgt ctttctccag 2 0 


<210> SEQ ID HO 117 
<211> LENGTH: 2 0 
<212> TYPE: DHA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 


<4 0 0> SEQUENCE: 114 


ccagaccatc atcccatgga a 


21 


<4 0 0> SEQUENCE: 117 
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tcctatgggc tgtttgatgc 

<210> SEQ ID HO 118 
<211> LENGTH: 21 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 118 
gtctttctcc agatgatgcc a 

<210> SEQ ID HO 119 
<211> LENGTH : 22 
<212> TYPE: DHA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Primer 
<4 0 0> SEQUENCE: 119 
gtctttctcc agatgatgcc aa 

<210> SEQ ID HO 120 
<211> LENGTH: 23 
<212> TYPE: DHA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER INFORMATION: Primer 
<400> SEQUENCE: 120 
gtctttctcc agatgatgcc agt 

<210> SEQ ID HO 121 
<211> LENGTH: 2 3 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<2 2 0> FEATURE: 

<223> OTHER IH FORMAT I ON : Primer 
<4 0 0> SEQUENCE: 121 
agcggataac aatttcacac agg 


<210> SEQ ID HO 122 
<211> LENGTH: 16 
<212> TYPE: DHA 

<213> ORGANISM: Artificial Sequem 
<2 2 0> FEATURE: 

<223> OTHER IN FORMAT I ON : Primer 


<400> SEQUENCE: 122 
ggcgcacgcc tccacg 


What is claimed: 

1. A method for detecting the presence or absence in a 
subject of at least one allelic variant of a polymorphic region 
of a gene associated with cardiovascular disease, compris- 
ing: 

the step of detecting the presence or absence of an allelic 
variant of a polymorphic region of a cytochrome C 
oxidase subunit VIb (COX6B) gene of the subject that 


is associated with high serum cholesterol or an allelic 
variant of a polymorphic region of a N-acetylglu- 
cosaminyl transferase component GPI-1 (GPI-1) gene 
of the subject that is associated with low serum high 
density lipoprotein (HDL). 
2. The method of claim 1, wherein the allelic variant is of 
a polymorphic region of the N-acetylglucosaminyl trans- 
ferase component GPI-1 (GPI-1) gene. 
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3. The method of claim 1, further comprising detecting the 
presence or absence in a subject of least one allelic variant 
of another gene associated with cardiovascular disease. 

4. The method of claim 3, wherein the other gene is 
selected from the group consisting of cholesterol ester 
transfer protein, plasma (CETP); apolipoprotein A-IV (APO 
A4); apolipoprotein A-I (APO Al); apolipoprotein E (APO 
E); apolipoprotein B (APO B); apolipoprotein C-III (APO 
C3); a gene encoding lipoprotein lipase (LPL); ATP-binding 
cassette transporter (ABC 1); paraoxonase 1 (PON 1); 
paraoxonase 2 (PON 2); 5, 10-methylenetetrahydro folate r 
reductase (MTHFR); a gene encoding hepatic lipase, E-se- 
lectin, G protein beta 3 subunit and angiotensin II type 1 
receptor gene. 

5. The method of claim 2, wherein the polymorphic region 
is a single nucleotide polymorphism (SNP). 

6. The method of claim 5, wherein the SNP is at position 
2577 of the N-acetylglucosaminyl transferase component 
GPI-1 (GPI-1) gene sequence and the allelic variant is 
represented by an A nucleotide in the sense strand or a T 
nucleotide in the corresponding position in the antisense 
strand. 

7. The method of claim 1, wherein the detecting step is by 
a method selected from the group consisting of allele 
specific hybridization, primer specific extension, oligonucle- 
otide ligation assay, restriction enzyme site analysis and 
single- stranded conformation polymorphism analysis. 

8. The method of claim 6, further comprising: 

(a) hybridizing a target nucleic acid comprising a 
N-acetylglucosaminyl transferase component GPI-1 
(GPI-l)-encoding nucleic acid or fragment thereof with 
a nucleic acid primer that hybridizes adjacent to nucle- 
otide 2577 of the GPI-1 gene; 

(b) extending the nucleic acid primer using the target 
nucleic acid as a template; and 

(c) determining the mass of the extended primer to 
identify the nucleotide present at position 2577, thereby 
determining the presence or absence of the allelic 
variant. 

9. The method of claim 1, wherein the detecting step 
comprises mass spectrometry. 

10. The method of claim 1, wherein the detecting step 
utilizes a signal moiety selected from the group consisting 
of: radioisotopes, enzymes, antigens, antibodies, spectro- 
photometric reagents, chemiluminescent reagents, fluores- 
cent reagents and other light producing reagents. 

11. The method of claim 8, wherein the nucleic acid 
primer is extended in the presence of at least one dideoxy- 
nucleotide. 

12. The method of claim 11, wherein the dideoxynucle- 
otide is dideoxyguanosine (ddG). 

13. The method of claim 8, wherein the primer is extended 
in the presence of at least two dideoxynucleotides and the 
dideoxynucleotides are dideoxyguanosine (ddG) and 
dideoxycytosine (ddC). 

14. A method for indicating a predisposition to cardio- 
vascular disease in a subject, comprising: 

the step of detecting in a target nucleic acid obtained from 
the subject the presence or absence of at least one 
allelic variant of polymorphic regions of a cytochrome 
C oxidase subunit VIb (COX6B) gene associated with 
high serum cholesterol or at least one allelic variant of 


polymorphic regions of a N-acetylglucosaminyl trans- 
ferase component GPI-1 (GPI-1) gene associated with 
low serum HDL wherein the presence of an allelic 
variant is indicative of a predisposition to cardiovas- 
cular disease compared to a subject who does not 
comprise the allelic variant. 

15. The method of claim 14, wherein the allelic variant is 
of a polymorphic region of the N-acetylglucosaminyl trans- 
ferase component GPI-1 (GPI-1) gene. 

16. The method of claim 15, wherein the polymorphic 
region is a single nucleotide polymorphism (SNP). 

17. The method of claim 16, wherein the SNP is at 
position 2577 of the N-acetylglucosaminyl transferase com- 
ponent GPI-1 (GPI-1) gene sequence and the allelic variant 
is represented by an A nucleotide in the sense strand or a T 
nucleotide in the corresponding position in the antisense 
strand. 

18. The method of claim 14, wherein the detecting step is 
by a method selected from the group consisting of allele 
specific hybridization, primer specific extension, oligonucle- 
otide ligation assay, restriction enzyme site analysis and 
single-stranded conformation polymorphism analysis. 

19. The method of claim 17, further comprising: 

(a) hybridizing a target nucleic acid comprising a 
N-acetylglucosaminyl transferase component GPI-1 
(GPI-1 )-encoding nucleic acid or fragment thereof 
with a nucleic acid primer that hybridizes adjacent to 
nucleotide 2577 of the GPI-1 gene; 

(b) extending the nucleic acid primer using the target 
nucleic acid as a template; and 

(c) determining the mass of the extended primer to 
identify the nucleotide present at position 2577, thereby 
determining the presence or absence of the allelic 
variant. 

20. The method of claim 14, wherein the detecting step 
comprises mass spectrometry. 

21. The method of claim 14, wherein the detecting step 
utilizes a signal moiety selected from the group consisting 
of: radioisotopes, enzymes, antigens, antibodies, spectro- 
photometric reagents, chemiluminescent reagents, fluores- 
cent reagents and other light producing reagents. 

22. The method of claim 14, further comprising detecting 
the presence or absence of at least one allelic variant of 
polymorphic regions of another gene associated with car- 
diovascular disease, wherein the presence of the two allelic 
variants is associated with a predisposition to cardiovascular 
disease compared to a subject who does not comprise the 
combination of allelic variants. 

23. The method of claim 22, wherein the other gene is 
selected from the group consisting of cholesterol ester 
transfer protein, plasma (CETP); apolipoprotein A-IV (APO 
A4); apolipoprotein A-I (APO Al); apolipoprotein E (APO 
E); apolipoprotein B (APO B); apolipoprotein C-III (APO 
C3); a gene encoding lipoprotein lipase (LPL); ATP-binding 
cassette transporter (ABC 1); paraoxonase 1 (PON 1); 
paraoxonase 2 (PON 2); 5, 10-methylenetetrahydro folate r 
reductase (MTHFR); a gene encoding hepatic lipase, E-se- 
lectin, G protein beta 3 subunit and angiotensin II type 1 
receptor gene. 

24. The method of claim 22, wherein the two allelic 
variants are of the cytochrome C oxidase subunit VIb 
(COX6B) gene and the N-acetylglucosaminyl transferase 
component GPI-1 (GPI-1) gene. 
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25. A method of screening for biologically active agents 
that modulate serum high density lipoprotein (HDL), com- 
prising: 

(a) combining a candidate agent with a cell comprising a 
nucleotide sequence encoding an allelic variant of a 
N-acetylglucosaminyl transferase component GPI-1 
(GPI-1) gene associated with low levels of serum HDL 
and operably linked to a promoter such that the nucle- 
otide sequence is expressed as a GPI-1 protein in the 
cell; and 

(b) determining the affect of the agent upon the expression 
and/or activity of the GPI-1 protein. 

26. A method of screening for biologically active agents 
that modulate serum high density lipoprotein (HDL), com- 
prising: 

(a) combining a candidate agent with a transgenic mouse 
comprising a transgenic nucleotide sequence stably 
integrated into the genome of the mouse encoding an 
allelic variant of a N-acetylglucosaminyl transferase 
component GPI-1 (GPI-1) gene associated with low 
levels of serum HDL operably linked to a promoter, 
wherein the transgenic nucleotide sequence is 
expressed and the transgenic animal develops a low 
level of serum HDL; and 

(b) determining the affect of the agent upon the serum 
HDL level. 

27. The method of claim 25, wherein the allelic variant is 
at position 2577 of the N-acetylglucosaminyl transferase 
component GPI-1 (GPI-1) gene. 

28. The method of claim 26, wherein the allelic variant is 
at position 2577 of the N-acetylglucosaminyl transferase 
component GPI-1 (GPI-1) gene. 

29. A method for predicting a response of a subject to a 
cardiovascular drug, comprising: 

detecting the presence or absence of at least one allelic 
variant of a cytochrome C oxidase subunit VIb 
(COX6B) gene of the subject associated with high 
serum cholesterol or at least one allelic variant of a 
N-acetylglucosaminyl transferase component GPI-1 
(GPI-1) gene of the subject associated with low serum 
high density lipoprotein (HDL); 

wherein the presence of at least one allelic variant is 
indicative of a positive response. 

30. The method of claim 29, wherein the allelic variant is 
of the N-acetylglucosaminyl transferase component GPI-1 
(GPI-1) gene. 

31. A method for predicting a response of a subject to a 
biologically active agent that modulates serum high density 
lipoprotein (HDL), comprising: 

detecting the presence or absence of at least one allelic 
variant of a N-acetylglucosaminyl transferase compo- 
nent GPI-1 (GPI-1) gene of the subject associated with 
low HDL; wherein the presence of an allelic variant is 
indicative of a positive response. 

32. A method for predicting a response of a subject to a 
biologically active agent that modulates serum high density 
lipoprotein (HDL) levels, comprising: 

(a) detecting the presence or absence of at least one allelic 
variant of a N-acetylglucosaminyl transferase compo- 
nent GPI-1 (GPI-1) gene associated with low HDL of 
the subject; and 


(b) detecting the presence or absence of an allelic variant 
in at least one other gene of subject associated with 
cardiovascular disease, wherein the presence of both 
allelic variants is indicative of a positive response. 

33. The method of claim 31, wherein the allelic variant of 
a N-acetylglucosaminyl transferase component GPI-1 (GPI- 
1) gene is at position 2577. 

34. The method of claims 32, wherein the allelic variant 
of a N-acetylglucosaminyl transferase component GPI-1 
(GPI-1) gene is at position 2577. 

35. The method of claim 32, wherein the other gene 
associated with cardiovascular disease is selected from the 
group of genes consisting of cytochrome C oxidase subunit 
VIb (COX6B); cholesterol ester transfer protein, plasma 
(CETP); apolipoprotein A- IV (APO A4); apolipoprotein A-I 
(APO Al); apolipoprotein E (APO E); apolipoprotein B 
(APO B); apolipoprotein C-III (APO C3); a gene encoding 
lipoprotein lipase (LPL); ATP -binding cassette transporter 
(ABC 1); paraoxonase 1 (PON 1); paraoxonase 2 (PON 2); 
5,10-methylenetetrahydrofolate r reductase (MTHFR); a 
gene encoding hepatic lipase, E-selectin, G protein beta 3 
subunit and angiotensin II type I receptor gene. 

36. A primer or probe that specifically hybridizes adjacent 
to or at a polymorphic region of a cytochrome C oxidase 
subunit VIb (COX6B) gene associated with high serum 
cholesterol in combination with a primer or probe that 
specifically hybridizes adjacent to or at a polymorphic 
region of a N-acetylglucosaminyl transferase component 
GPI-1 (GPI-1) gene associated with low HDL. 

37. The primers or probes of claim 36, further comprising 
primers or probes that specifically hybridizes adjacent to or 
at a polymorphic region of another gene associated with 
cardiovascular disease. 

38. The primers or probes of claim 36, wherein the 
polymorphic region of the cytochrome C oxidase subunit 
VIb (COX6B) gene comprises nucleotide 86 of the coding 
strand and the polymorphic region of the N-acetylglu- 
cosaminyl transferase component GPI-1 (GPI-1) gene com- 
prises nucleotide 2577. 

39. The primers or probes of claim 37, wherein the other 
gene associated with cardiovascular disease is selected from 
the group of genes consisting of cholesterol ester transfer 
protein, plasma (CETP); apolipoprotein A-IV (APO A4); 
apolipoprotein A-I (APO Al); apolipoprotein E (APO E); 
apolipoprotein B (APO B); apolipoprotein C-III (APO C3); 
a gene encoding lipoprotein lipase (LPL); ATP-binding 
cassette transporter (ABC 1); paraoxonase 1 (PON 1); 
paraoxonase 2 (PON 2); 5,10-methylenetetrahydrofolate r 
reductase (MTHFR); a gene encoding hepatic lipase, E-se- 
lectin, G protein beta 3 subunit and angiotensin II type 1 
receptor gene. 

40. A kit for indicating whether a subject has a predispo- 
sition to developing cardiovascular disease, comprising: 

(a) at least one probe or primer that specifically hybridizes 
adjacent to or at a polymorphic region of a N-acetyl- 
glucosaminyl transferase component GPI-1 (GPI-1) 
gene associated with low serum high density lipopro- 
tein (HDL). 

41. The kit of claim 40 further comprising instructions for 
use. 

42. The kit of claim 40, wherein the polymorphic region 
comprises nucleotide 2577 of the coding strand. 

43. A kit for indicating whether a subject has a predispo- 
sition to developing cardiovascular disease, comprising: 
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(a) at least one probe or primer which specifically hybrid- 
izes adjacent to or at a polymorphic region of a 
N-acetylglucosaminyl transferase component GPI-1 
(GPI-1) gene associated with low serum high density 
lipoprotein (HDL); and 

(b) at least one probe or primer which specifically hybrid- 
izes adjacent to or at a polymorphic region of another 
gene associated with cardiovascular disease. 

44. The kit of claim 43, further comprising instructions for 
use. 

45. The kit of claim 43, wherein the other gene associated 
with cardiovascular disease is selected from the group of 
genes consisting of cytochrome C oxidase subunit VIb 
(COX6B); cholesterol ester transfer protein, plasma 
(CETP); apolipoprotein A-IV (APO A4), apo lipoprotein A-I 
(APO Al); apolipoprotein E (APO E); apolipoprotein B 
(APO B); apolipoprotein C-III (APO C3); a gene encoding 
lipoprotein lipase (LPL); ATP-binding cassette transporter 
(ABC 1); paraoxonase 1 (PON 1); paraoxonase 2 (PON 2); 
5, 10-methylenetetrahydro folate r reductase (MTHFR); a 
gene encoding hepatic lipase, E-selectin, G protein beta 3 
subunit and angiotensin II type 1 receptor gene. 

46. A method of diagnosing a predisposition to cardio- 
vascular disease in a human, said method comprising the 
steps of: 

(a) obtaining a biological sample from the human; 

(b) isolating DNA from the biological sample; and 

(c) detecting the presence or absence of at least one allelic 
variant of a N-acetylglucosaminyl transferase compo- 
nent GPI-1 (GPI-1) gene in the DNA. 

47. The method of claim 46, wherein at least one variant 
is a G to A transversion at position 2577 of a N-acetylglu- 
cosaminyl transferase component GPI-1 (GPI-1) gene. 

48. A method of determining a response of a human to a 
cardiovascular drug, said method comprising the steps of: 

(a) obtaining a biological sample from the human; 

(b) isolating DNA from the biological sample; and 

(c) detecting the presence or absence of at least one allelic 
variant of a cytochrome C oxidase subunit VIb 
(COX6B) gene in the DNA or at least one allelic variant 
of a N-acetylglucosaminyl transferase component 
GPI-1 (GPI-1) gene in the DNA. 


49. The method of claim 46, wherein the detecting step is 
performed by an assay selected from the group consisting of 
allele specific hybridization, primer specific extension, oli- 
gonucleotide ligation, restriction enzyme site analysis, and 
single-stranded conformation polymorphism analysis. 

50. The method of claim 48, wherein the detecting step is 
performed by an assay selected from the group consisting of 
allele specific hybridization, primer specific extension, oli- 
gonucleotide ligation, restriction enzyme site analysis, and 
single-stranded conformation polymorphism analysis. 

51. A microarray comprising a nucleic acid having a 
sequence of a polymorphic region from a human N-acetyl- 
glucosaminyl transferase component GPI-1 (GPI-1) gene. 

52. The microarray of claim 51, wherein the polymorphic 
region comprises a locus selected from the group consisting 
of position 2577 of the human N-acetylglucosaminyl trans- 
ferase component GPI-1 (GPI-1) gene, position 2829 of the 
human GPI-1 gene, position 2519 of the human GPI-1 gene, 
position 2289 of the human GPI-1 gene, position 1938 of the 
human GPI-1 gene, position 1563 of the human GPI-1 gene, 
position 2656 of the human GPI-1 gene, and position 2664 
of the human GPI-1 gene. 

53. The microarray of claim 52, wherein the polymorphic 
region comprises position 2577 of the human N-acetylglu- 
cosaminyl transferase component GPI-1 (GPI-1) gene. 

54. A kit comprising: 

(a) at least one probe specific for a polymorphic region of 
a human gene selected from the group consisting of 
cytochrome C oxidase subunit VIb (COX6B); 
N-acetylglucosaminyl transferase component GPI-1 
(GPI-1); cholesterol ester transfer protein, plasma 
(CETP); apolipoprotein A-IV (APO A4); apolipopro- 
tein A-I (APO Al); apolipoprotein E (APO E); apoli- 
poprotein B (APO B); apolipoprotein C-III (APO C3); 
a gene encoding lipoprotein lipase (LPL); ATP-binding 
cassette transporter (ABC 1); paraoxonase 1 (PON 1); 
paraoxonase 2 (PON 2); 5,10-methylenetetrahydro- 
folate r reductase (MTHFR); a gene encoding hepatic 
lipase, E-selectin, G protein beta 3 subunit and angio- 
tensin II type 1 receptor gene; and 

(b) instructions for use. 


